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METHODS OF USE OF THE EN ZYMES OF M YCOTmOLSYNTOffiSIS 

FIELD OF THE INVENTION 

[00011 The invention relates generaUy to isolation and identification of three famihes of 
enzymatic compounds produced by bacteria and involved in the steps of mycothiol 
biosynthesis and, more specifically, to identification of MshC, MshD and MshA and 
methods of use thereof, especially for use in drug discovery and disease control. 

BACKGROUND INFORMATION 

[00021 Glutathione (GSH) is the dominant low molecular weight thiol in most 
eukaryotes and Gram-negative bacteria, and it plays a key role in protection of the cell 
against oxygen toxicity and electrophiUc toxins. However, most gram-positive bacteria, 
including many strict aerobes, do not produce glutathione. Yet aerobic organisms are 

subjected to oxidative stress from many sources, including atmospheric oxygen, basal 
metaboUc activities, and, in the case of pathogenic microorganisms, toxic oxidants from the 
host phagocytic response intended to destroy the bacterial invader. 

[00031 Actinomycetes, including Streptomyces and Mycobacterium, do not make GSH 
but produce instead millimolar levels of mycothiol (MSH, AcCys-GlcN-Ins), an unusual 
conjugate of N-acetylcysteine (AcCys) with lD-m3;o-inosityl 2-amino-2-deoxy-flr-D- 
glucopyranoside (GlcN-Ins). The biochemistry of mycothiol appears to have evolved 
completely independently of that of glutathione. However, it has akeady been estabUshed 
that the metaboUsm of mycothiol parallels that of glutathione metaboUsm in several 
enzymatic processes. First, formaldehyde is detoxified in glutathione-producing organisms 
by NAD/glutathione-dependent formaldehyde dehydrogenase (L. UotUa, et al. (1989) in 
Glutathione: Chemical, Biochemical, and Medical Aspects - Part A (D. Dolphin, et al., 
Eds.) pp 517-551, John WUey & Sons, et al.). An analogous process involving 
NAD/mycothiol-dependent formaldehyde ddiydrogenase has been identified in the 
actinomycete Amycolatopsis methanolica (M. Misset-Smits, et al. (1997) FEBS Lett. 
409:221-222). This enzyme has been sequenced (A. Norin, et al. (1997) Eur. J. Biochem. 
248:282-289). 
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[0004] The second aizymatic process involves a mycothiol homolog of glutathione 
reductase recently cloned from M. tuberculosis and expressed inM smegmatis (M.P. Patel, 
et al. (1999) J. Amer. Chem. Soc. 120:1 1538-1 1539; M.P. Patel et al. (1999) Biochemistry 
3^:11827-11833; MJ». Patel et al (2001) Biochemistry ^3\\9-3\26). The reductase is 
reasonably specific for the disulfide of mycothiol, but is also active with the disulfide of 
AcCys-GlcN, the desmyo-inositol derivative of mycothioL 

[0005] A general mycothiol-dependait detoxification process has been described in 
M. smegmatis in which MSH forms S-conjugates (MSR) with reactive electrophiles, 
including some antibiotics, and MSR is subsequently degraded by the enzyme mycothiol S- 
conjugate amidase to produce GlcN-Ins and AcCySR, a metcapturic acid, which is excreted 
from the cell; in MSR R is derived fixjm the electrophile (Newton, et al. (2000) 
Biochemistry 32: 1 0739- 1 0746). 

[0006] The biosynthesis of MSH has been identified as involving four steps: (1) 
formatiqn of GlcNAc-Ins; (2) deacetyladon of GlcNAc-Ins to produce GlcN-lns; (3) 
Ugation of GlcN-Ins to Cys to produce Cys-GlcN-Ins; (4) acetylation of Cys-GlcN-Ins by 
acetyl-CoA to produce MSH (Bomemann, et al. (1997) Biochem. J. 325:623-629; 
Anderberg, et al. (1998) J. Biol. Chem. 275:37317-37323; Newton, et al. (2000) J. 
Bacteriol. 182.6958-6963). The genes encoding these biosynthesis steps have been 
designated msHA, mshB, mshC, and mshD but the only biosynthetic gene identified thus far 
is the mshB gene encoding the deacetylase MshB (Newton, et al. (2000) J. Bacteriol. 
182:6958-6963). 

[0007] The structure of mycothiol, 1-D-myo-inosityl 2-(iV-acetyl-L-cy5teinyl)amido-2- 
deoxy-a-D-glucopyranoside (AcCys-GlcN-Ins), makes it resistant to heavy-metal-catalyzed 
autoxidation (Newton et al. 1995) and it appears to have fimctions analogous to those of 
glutattiione. A mycothiol-dependent formaldehyde dehydrogenase has been identified 
(Misset-Smits et al. 1997; Norin et al. 1997). Mycobacterium smegmatis mutants defective 
in MSH biosynthesis exhibit enhanced sensitivity to hydrogen peroxide and modified 
sensitivity to antibiotics (Newton et al. 1999). Alkylating agents are detoxified by 
mycothiol and the resulting S'-conjugates cleaved by an amidase to produce tiie N- 
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acetylcysteine derivative (mercapturic acid), which is excreted fix>m the cell (Newton et aL 
2000b). A mycothiol disulfide reductase maintains mycothiol in the reduced state (Patel 
and Blanchard 1999; Patel and Blanchard 2001). 

[0008] Therefore, there is a need in the art for methods and compounds useful for 
investigation of the details of the metabolism of mycothiol and comparison with the 
established roles for the metabolism of glutathione and for identification of as yet 
unidentified biosynthesis genes. 

[0009] Antibiotic resistance of pathogenic bacteria, including pathogenic actinomycetes, 
such as Af. tuberculosis, is a well-known problem faced by medical practitioners in 
treatment of bacterial diseases. Therefore, there is a need in the art for new antibiotics, 
drags and vaccines and for screening techniques to discover antibiotics, drags and vaccines 
effective to treat or prevent bacterial infections m humans and in other mammals, such as 
domestic and farm animals. 

SUMMARY OF THF^ INVENTION 

[0010] The present invention relates to isolation and characterization of MshC, MshD 
and MshA, enzymes involved in the mycothiol biosynthesis pathway and provides methods 
utilizing such enzymes. 

[0011] In one embodiment, the invention provides a method for identifying an inhibitor 
of cysteinerglucosaminyl inositol ligase (MshC). The method includes contacting a 
candidate compound with a cysteine:glucosaminyl inositol ligase in the presence of cysteine 
and a glucosaminyl inositol or a derivative thereof, xmder suitable conditions, and 
determining the presence or absence of ligation of the cysteine to the glucosaminyl inositol 
or derivative thereof In the embodiment of the invention, a substantial absence of the 
ligation is indicative of a candidate compound that inhibits activity of the ligase. In another 
embodiment, the invention provides ah inhibitor identified by the method. 

[0012] In yet another embodiment, the invention provides a method for decreasing the 
virulence of a pathogenic cysteine:glucosaminyl inositol ligase-producing bacterium in 
mammalian cells. The method includes introducing an inhibitor of cysteine.glucosaminyl 
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inositol ligase activity into the bacterium and observing the effect on the activity of the 
ligase. Where ttie intracellular presence of the inhibitor decreases activity of the ligase, 
mycothiol biosynthesis by the bacterium is also decreased, as compared with untreated 
control bacterium. 

[0013] In still anotha: embodiment, the invention provides a method for increasing 
sensitivity of a patiiogenic cysteine:glucosaminyl inositol ligase-producing bacterium in 
mammalian cells to an antibiotic. The method includes introducing an inhibitor of 
cysteinetglucosaminyl inositol Ugase activity into the bacterium. The intracellular presence 
of the inhibitor decreases activity of the ligase, thereby decreasing mycothiol biosynthesis 
by the bacterium in said mammalian cells as compared with untreated control bacterium so 
as to increase sensitivity of the bacterium to an antibiotic. 

[0014] The invention also provides a method for inhibiting growth of a glucosaminyl 
inositol-producing bacterium in a mammal. The method includes administering an effective 
amount of an inhibitor of intracellular cysteine:glucosaminyl inositol Ugase to the mammal, 
thereby inhibiting growth of flie bacterium in the manmial. 

[0015] The invention also provides an inhibitor of cysteine:glucosaininyl inositol ligase, 
where the inhibitor is derived fiom L-cysteine by replacing the carboxyl group with a 
moiety tiiat binds the enzyme active site. 

[0016] In another embodiment, the invention provides a method for identifying an 
inhibitor of acetyl-CoArcysteinyl glucosaminyl inositol (acetyl-CoA:Cys-GlcN-Lis) 
acetyltransferase (MshD). The method includes contacting a candidate compound with an 
acetyl-CoA:Cys-GlcN-Ins acetyltransferase in the presence of an acetyl-CoA and cysteinyl 
glucosaminyl inositol (Cys-GlcN-Ihs) or a derivative thereof, under suitable conditions and 
determining the presence or absence of a transfer of acetyl to the Cys-GlcN-Ins or a 
derivative thereof. In the embodiment of the invention, the substantial absence of a transfer 
of acetyl is indicative of a candidate compound that inhibits activity of the acetyltransferase. 
In another embodiment, the invention provides an inhibitor identified by the method. 
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[0017] The invention also provides a method for decreasing the virulence of a 
pathogenic acetyl-CoA:Cys-GlcN-Ins acetyltransferase-producing bacterium in mammalian 
cells. The method includes introducmg an inhibitor of acetyl-CoA:Cys-GlcN-Ins 
acetyltransferase activity into the bacterimn, where the intracellular presence of the inhibitor 
decreases activity of the acetyltransferase. Such a decrease also deoreases mycothiol 
biosynthesis by the bacterium as compared with untreated control bactaium. 

[0018] In another embodiment, the invention provides a method for increasing 
sensitivity of a pathogenic acetyl-CoA:Cys-GlcN-Ihs acetyltransferase-producing bacterium 
in mammahan cells to an antibiotic. The method includes introducmg an inhibitor of 
endogenous bacterial acetyltransferase activity into the bacterium, where the intracellular 
presence of the inhibitor decreases activity of the acetyltransferase. Such a decrease in 
activity also decreases mycotiiiol biosynthesis by the bacterium in said mammaUan cells as 
compared with untreated control bacterium so as to increase sensitivity of the bacterium to 
an antibiotic. 

[0019] In still another embodiment, the invention provides a method for inhibiting 
growth of an acetyl-CoA:Cys-GlcN-Ins-producing bacterium in a mammal. The metiiod 
includes administering to the mammal an effective amount of an inhibitor of intracellular 
acetyl-CoA:Cys-GlcN-Ins acetyltransferase, thereby inhibiting growth of the bacterium in 
the mammal. 

[0020] In yet another embodiment, the invention provides a method for identifying an 
inhibitor of MshA glycosyltransferase (MshA). The metiiod includes contacting a candidate 
compound with a mycothiol-producing bacterium under suitable conditions, and 
determining the presence or absence of lD-7?jyo-inosityl 2-acetamido-2-deoxy-a-D- 
glucopyranoside (GlcNAc-Ins) within the mycothiol-producing bacterium. A substantial 
absence of GlcNAc-Ins within the bacterium is indicative of a compound that inhibits 
activity of the glycosyltransferase. In another embodiment, the invention provides an 
inhibitor identified by the method. 

[0021] In another embodiment, the invention provides a method for decreasing the 
virulence of a pathogenic MshA glycosyltransferase-producing bacterium in mammalian 
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cells. The method includes introducing an inhibitor of MshA glycosyltransferase activity 
into the bacterium. The intracellular presence of the inhibitor decreases activity of the 
glycosyltransferase, thereby decreasing mycothiol biosynthesis by the bacterium as 
compared with untreated control bacterium. 

[0022] hi still another embodiment, the invention provides a method for increasing 
sensitivity of a pathogenic MshA glycosyltransferase-producmg bacterium in mammaUan 
cells to an antibiotic. The method includes introducing an inhibitor of endogenous bacterial 
glycosyltransferase activity into the bacterium. The mtracellular presence of the inhibitor m 
the method if the invention decreases activity of the MshA glycosyltransferase and thereby 
decreasing mycothiol biosynthesis by the bacterium in said mammaUan cells as compared 
with untreated control bacterium. The decrease in mycothiol biosynthesis increases 
sensitivity of the bacterium to an antibiotic. 

[0023] The invention also provides a method for inhibiting growth of a GlcNAc-his- 
producing bacterium in a mammal. The method includes administering an effective amount 
of an inhibitor of intracellular MshA glycosyltransferase to the manunal. Such 
administration inhibits growth of the bacterium in tihe mammal. 

[0024] In yet another embodiment, the invention provides a method for identifying an 
inhibitor of mycothiol biosynthesis. The method includes contacting a candidate compound 
for inhibition of MshC. MshD or MshA or a combination thereof with a mycothiol- 
producing bacterium under suitable conditions, and determining the presence or absence of 
mycothiol within the bacterium. A substantial absence of mycothiol within the bacterium is 
indicative of a compound that inhibits activity of the MshC, MshD or MshA and therefore 
inhibits mycothiol biosynthesis, hi another embodiment, the mvention provides an inhibitor 
identified by the method. 

[0025] hi yet another embodiment, the invention provides a method for mcreasing 
sensitivity of a pathogenic mycothiol-producmg bacterium in mammalian cells to an 
antibiotic. The method includes inti-oducing an inhibitor of endogenous bacterial mycothiol 
biosynthesis into the bacterium. The intracellular presence of the inhibitor decreases 
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mycothiol biosynthesis by the bacterium in the manunalian cells as compared with 
untreated control bacterium so as to increase sensitivity of the bacterium to an antibiotic. 

[0026] In another embodiment, the invention provides a live mutant actinomycete, 
whose genome comprises a disruption in an endogenous gene or genes encoding a 
mycothiol biosynthesis enzyme. The mycothiol biosynthesis enzyme gene is selected from 
mshC, mshD or mshA. The disnqption prevents function of an mycothiol biosynthesis 
enzyme while cell surfece proteins and lipids are substantially unaffected, and wherein said 
disruption results in said mutant actinomycetes exhibiting transient survival in mammalian 
white blood cells for an immune response-raising period of time. 

[0027] In still another embodknent, the invention provides a purified 
cysteine:glucosaminyl inositol ligase with an amino acid sequence with 35% or more 
sequence identity to SEQ ID NO: 2 or conservative variations thereof, and which has 
cysteine:glucosaminyl inositol ligase activity. 

[0028] The invention also provides a purified acetyl-CoA:Cys-GlcN-Ins 
acetyltransferase with an amino acid sequence with 35% or more sequence identity to SEQ 
ID NO: 15 or conservative variations thereof, and which has acetyl-CoA:Cys-GlcN-Ins 
acetyltransferase activity. 

[0029] In yet another embodiment, the invention provides a purified MshA 
glycosyltransferase with an amino acid sequence with 35% or more sequence identity to 
SEQ ID NO: 19, and conservative variations thereof, and which has MshA 
glycosyltransferase activity. 

[0030] In still another embodiment, the invention provides an expression vector 
comprising polynucleotides of mshA, mshB, mshC and mshD. In the expression vector, the 
polynucleotide of mshA is SEQ ID NO: 49, the polynucleotide of mshC is SEQ ID NO: 1 
and the polynucleotide of mshC is SEQ ID NO: 48. 

[0031] In another embodiment, the invention provides a method for identifying an 
inhibitor of cysteine: glucosaminyl inositol Ugase (MshC). The method includes contacting 
a candidate compound with a cysteine:glucosaminyl inositol Ugase in the presence of a 
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cysteine; a glucosaminyl inositol or a derivative thereof and ATP, under suitable conditions, 
and assaying for the generation of pyrophosphate. The substantial absence of 
pyrophosphate, as determined by the assay, is indicative of a candidate compound that 
inhibits activity of the ligase. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0032] Figure 1 is a schematic representation of the biosynthesis pathway for mycothiol. 

[0033] Figure 2 A shows the nucleotide sequence of mshC of M smegmatis (SEQ ID 
NO: 1). Figure 2B shows the corresponding deduced amino acid sequence (SEQ ID NO: 2) 
of SEQ ID NO: 1 . Figure 2C shows the nucleotide sequence of mshC ofM. tuberculosis 
(Rv 213bc) (TubercuUst database) (SEQ ID NO: 3) and Figure 2D shows the corresponding 
deduced amino acid sequence (SEQ ID NO: 4) of SEQ ID NO: 3. 

[0034] Figures 3A-D are graphs of the chromatographic profiles for purification of 
MshC: Figure 3A shows the chromatographic profile of DEAE ion exchange; Figure 3B 
shows the chromatographic profile of the hydroxyl apatite column; Figure 3C shows the 
chromatographic profile of Reactive Brown 10 afiSnity chromatography; and Figure 3D 
shows the chromatographic profile of Sephadex G-lOO gel filtration. 

[0035] Figure 4 shows an alignment of the amino acid sequences of orthologs of MshC. 
Row 1 shows MshC of M. smegmatis (SEQ ID NO: 2) (nucleic acids 1222384-1223620 of 
unfinished M smegmatis genomic sequence (TIGR)); Row 2 shows the ortholog of M 
tuberculosis (Rv2130c) (TubercuUst database) (SEQ ID NO: 4; Row 3 shows the ortholog 
of C. striatum (SEQ ID NO: 5) (GenBank Accession # AAG03366); and Row 4 shows the 
ortholog of & coelicolor (GenBank Accession No. CAC 36366) (SEQ ID NO: 6). 

[0036] Figure 5 shows an alignment of the amino acid sequaices of MshC (Row 1) with 
CysS of M tuberculosis (Row 2)(Rv3580c, GenBank Accession # NP_218097) (SEQ ID 
NO: 12) and CysS ofE. coli (Row 3)(GenBank Accession # NP_308615) (SEQ ID NO: 
13). 
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(00371 Figure 6 is a gr^h showing the pracent survival of the invention mutants as a 
function of peroxide concentration. Closed circles represent mc^l 55, open squares- 164, 
open circles - 49, upward closed triangles- Tnl, and downward open triangles represent 
Tii2. 

[00381 Figure 7 is a sdiematic representation of the chemistry of cysteinyl-tRNA 
synthetase and Cys:GlcN-Ins ligase reactions. 

[00391 Figure 8 shows an aligmnait of mycothiol synthase (MshD) from M. 
tuberculosis (SEQ ID NO: 14). M. smegmatis (SEQ ID NO: 15), M. leprae (SEQ ID NO: 
16). S. coelicolor A(3)2 (SEQ ID NO: 17), and C. diphtheriae (SEQ ID NO: 18); "~ 
pfem00583~" denotes two regions with similarity to the acetyltransferase consensus 
domaiiL- 

[00401 Figure 9 shows a section of M smegmatis cosmid 3269 showing the insertion 
site of the <KAN-2> transposon in the mshD gene (crosshatched) as defined by the mosaic 
end sequences (ME). Sequencing of &e M. smegmatis genomic DNA commenced with the 
<BCAN-2> reverse primer RP-1. The segments corresponding to M. tuberculosis open 
reading frames Rv0818 and Rv0819 are denoted as such. 

[00411 Figure lOA shows the amino acid sequence of MshD of M tuberculosis 
(Rv0819) (SEQ ID NO: 14); Figure lOB shows the amino acid sequence of MshD of M 
smegmatis (SEQ ID NO: 15); Figure IOC shows the amino acid sequence of MshD of M 
leprae (SEQ ID NO: 16); Figure lOD shows the amino acid sequence of MshD of S. 
coelicolor (SEQ ID NO: 17); Figure lOE shows the amino acid sequence of MshD of C. 
diphtheriae (SEQ ID NO: 18); and Figure lOF shows the nucleic acid sequence of MshD of 
M smegmatis (SEQ ID NO: 48). 

(00421 Figure 11 shows a sequence alignment for MshA from M smegmatis (SEQ ID 
NO: 19) and M. tuberculosis (SEQ ID NO: 20), with SpsA from Anaebaena sp. PCC7120 
(SEQ ID NO: 21), and PimB (Rv0557) from M. tuberculosis (SEQ ID NO: 22). Site of the 
G32D mutation in M. smegmatis mutant 49 is denoted by *. Numbering refers to the M 
tuberculosis MshA sequence. 
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[0043] Figure 12A shows ttie amino acid sequence of MshA of M smegmatis (SEQ ID 
NO: 19); Figure 12B shows the amino acid sequence of MshA of M. tuberculosis (SEQ ID 
NO: 20); and Figure 12C shows the nucleic acid sequence of MshA of M smegmatis (SEQ 
ID NO: 49). 

[0044] Figure 13 shows the gaiomic locus of M smegmatis transposon mutant 
mshA:Tn5, displaying the site occupied hy the EZ::TN<BCAN-2> transposon in the mshA 
gene (crosshatched) as defined hy Ihe mosaic end sequences (ME) and the reverse 
sequencing primer KAN-2 RP-1. The sites corresponding to the M tuberculosis HBTRv 
orthologs Rv0485 and Rv0486 are ^ven in parenthesis. 

[0045] Figure 14 is a restriction map of the pACE expression vector containing cloned 
mshC. 

[0046] Figure 15 shows chromatographic profiles for purification of MshC: (A) DEAE 
ion exchange; and (B) hydroxyl apatite. 

[0047] Figure 16 is a graph showing the effect of 1,10-phenanthioline and Zn^* on the 
activity of MshC. 

[0048] Figure 17 is a graph showing the stoichiometry of MshC Ugase reaction. 

[0049] Figure 18 shows the sequence aHgmnents of the a) First half of the Rossman fold 
of M. smegmatis MshC (SEQ ID NO: 31), M leprae MshC (SEQ ID NO: 32), S. coelicolor 
MshC (SEQ ID NO: 33), C striatum MshC (SEQ ID NO: 34), T.Jusca MshC (SEQ ID NO: 
35), M. tuberculosis CysS (SEQ ID NO: 36) and E. coli CysS (SEQ ID NO: 37) and b) 
Second half of the Rossman fold of M. smegmatis MshC (SEQ ID NO: 38). M leprae 
MshC (SEQ ID NO: 39). 5. coelicolor Msh C (SEQ ID NO: 40), C. striatum MshC (SEQ 
ID NO: 41), T.Jusca MshC (SEQ ID NO: 42). M. tuberculosis CysS (SEQ ID NO: 43) and 
E. coli CysS (SEQ ID NO: 44), orthologs of MshC with conserved Zn binding residues. 

ny.TAn.ED DESCRIPTION OF TNVENTION 

[0050] Mycothiol (iD-wyo-inosityl 2-(iV-acetylcystemyl)amido-2-deoxy-a-D- 
glucopyranoside) (MSH) is present in a variety of actinomycetes and plays an essential role 
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in a pathway of detoxification in such bacteria. Mycofliiol is comprised of iV-acetylcysteine 
(AcCys) amide linked to iD-myo-inosityl 2-amino-2-deoxy-a-D-glucopyranoside (GlcN- 
Ins) and is the major thiol produced by most actmomycetes. In the mycothiol-dependent 
detoxification process in actinomycetes, an alkylating agent is converted to a S-conjugate of 
mycothiol, the latter is cleaved to release a mercapturic acid, and the mercapturic acid is 
excreted firom the cell (Newton et al. (2000a) Biochemistry 22:10739-10746). 

[0051] Stoppmg the production of MSH should eliminate the MSH-dependent protective 
mechanisms and this makes tiie enzymes of mycotibdol biosynthesis of special interest The 
pathway of mycothiol biosynthesis involves at least four enzymes, as set forth in Figure 1. 
These enzymes are designated MshA, MshB, MshC and MshD. By the present invention, 
three of the four enzymes were isolated and identified and utilization of the identified 
sequences in methods of drug discovery and disease control are provided. MshB was 
previously identified and disclosed in U.S. AppUcation 10/297,5 12, filed December 6, 2002, 
hereby incorporated by reference in its entirety, which is a national stage application of 
PCT/USOl/19091, filed June 14, 2001, which claims priority to U.S. Provisional 
AppUcation 60/211,612, filed June 14, 2000. 

10052] A family of purified cysteine:glucosaminyl inositol ligase (MshC) polypeptides 
with catalytic ligase activity for glucosaminyl inositol (iD-myo-inosityl 2-amino-2-deoxy-a- 
Dglucopyranoside; GlcN-Ins) and cysteine or a cysteme derivative is provided. The 
cysteine:glucosaminyl inositol Ugases are characterized by having an amino acid sequence 
with 35% or more homology to SEQ ID NO: 2, for example 35% 40%. 45%, 50%, 55%, 
60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 100% identity thereto, and by having 
cysteine:glucosammyl inositol Ugase activity. The members of the family of ligases 
catalyze Ugation of cysteine to a glucosaminyl inositol or a derivative thereof For example, 
the Ugase catalyzes ATP-dependent Ugation of L-cysteine to iD-wyo-inosityl 2-amino-2- 
deoxy-<2-D-glucopyranoside. In one embodiment, the glucosaminyl inositol is a precursor 
of mycothiol (e.g., in a mycothiol producing bacterium). 

[0053] In one embodiment the Ugase has an amino acid sequence as set forth in SEQ ID 
NO: 2, SEQ ID NO: 4, SEQ ID NO: 5 or SEQ ID NO: 6 and catalyzes Ugation of cysteine 
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to a glucosaminyl inositol or a derivative thereof, especially ATP-dependent Ugation of the 
cysteine to a glucosaminyl inositol or a derivative thereof. In one embodiment the 
glucosaminyl inositol or derivative to be Ugated to the cysteine by the ligases is iD-myo- 
inosityl 2-amino-2-deoxy-o-D-glucopyranoside (GlcN-Ins). The Ugase may be encoded by 
a polynucleotide comprising a nucleic acid sequence as set forth in SEQ ID NO: 1. 

I00S4] As used herein, the term "glucosaminyl inositol derivative" means iL- or iD- 
myc^-inosityl 2-amino-2-deoxy-a-D-glucopyranoside, as well as 2-amino-2-deoxy-a-D- 
glucopyranosides derived fiom alcohols other than iD-myo-inosityl. 

[0055] As used herein, the term "cysteine" means the isomer of cysteine. 

[00561 AdditionaUy, there is provided a family of purified acetyl-CoA:cysteinyl 
glucosaminyl inositol (acetyl-CoA:Cys-GlcN-Ins) acetyltransferase (MshD) polypeptides 
with acetyltransferase activity for cysteinyl glucosaminyl inositol and acetyl-CoA. The 
acetyl-CoA:Cys-GlcN-Ins acetyltransferases are characterized by having an amino acid 
sequence with 35% or more homology to SEQ ID NO: 1 5, for example 35%, 40%, 45%, 
50%, 55%. 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 100% identity thereto, and by 
having acetyltransferase activity. The members of the family of acetyltransferases catalyze 
transfer of an acetyl group to a Cys-GlcN-Ins or derivative thereof, resulting in the 
production of mycothiol (AcCys-GlcN-Ins). The acetyltransferase may be encoded by a 
polynucleotide comprising a nucldc acid sequence as set forth in SEQ ID NO: 48. 

[0057] Additionally, a family of purified MshA glycosyltransferase (MshA) 
polypeptides with glycosyltransferase activity is provided. The MshA glycosyltransferases 
are characterized by having an amino acid sequence with 35% or more homology to SEQ 
ID NO: 19. for example, 35%, 40%. 45%. 50%, 55%, 60%. 65%, 70%. 75%. 80%, 85%, 
90%. 95% or 100% identity thereto, and by havmg glycosyltransferase activity. The 
members of the family of acetyltransferases catalyze production of iD-wiyo-inosityl 2- 
acetamido-2-deoxy-a-D-glucopyranoside (GlcNAc-lns). In one embodiment, the GlcNAc- 
his is a precursor of mycothiol (e.g., in a mycothiol producing bacterium). The 
glycosyltransferase may be encoded by a polynucleotide comprising a nucleic acid sequence 
as set forth in SEQ ID NO: 49. 
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(00581 The methods described herein further elaborate the pathway involved in MSH 
biosynthesis. The invention demonstrates that GlcNAc-Ins is an intracellular MSH 
component and is formed by activity of MshA. This conversion defines an initial step m 
mycothiol biosynthesis. Further, the invention demonstrates that GlcN-Ins is an 
intracellular MSH component in M. smegmatis and is converted to Cys-GlcN-Ins by 
Cys:GlcN-Ins ligase (MshC). This conversion defines the penultimate step in MSH 
biosynthesis. Additionally, the invention demonstrates that Cys-GlcN-Ins is a precursor to 
mycothiol and is converted to mycothiol by acetyl-CoA:Cys-GlcN-Ins acetyltransferase 
(MshD) activity. This conversion defines the final step in MSH biosynthesis. 

[0059] A member of the family of polypeptide Ugases shown to be responsible for ATP- 
dependent Ugation of L-cysteme to GlcN-his to form Cys-GlcN-Ihs has been cloned firom 
M. tuberculosis genomic sequence and conresponds to an open reading firame designated 
Rv2130c and misidentified as a probable cysS2, cysteinyl-tRNA synthetase (GenBank 
Accession # NP_216646)(Cole, et al. (1998) Nature 393:537-544). The nucleic acid 
sequence encoding this protein corresponds to nucleic acids 2391213-2392457 of the M. 
tuberculosis genome encoding a protein of 414 amino acid residues. The derived amino 
acid sequence is as set forth in SEQ ID NO: 4. A BLAST search with the M. tuberculosis 
MshC sequence on GenBank revealed additional homologs in Corynebacterium striatum 
(Accession # AAG03366) (SEQ ID NO: 5) and Streptomyces coelicolor (Accession # 
CAC36366) (SEQ ID NO: 6). The sequences for these MshC proteins are included in 
Figure 4 and have 54-58% identity to the M tuberculosis and Af. smegmatis sequences. 
Orthologs of M tuberculosis MshC were also found at the Sanger Centre in M. leprae 
(82% identity, S.T. Cole et al. (2001) Nature 409, 1007), M bavis (96% identity; website 
Sanger.org/Projects/M_bovis), and Corynebacterium diphtheriae (54% identitjr, website 
Sanger.6rg/Projects/C_diphtheriae), and at TIGR in M. avium (81% identity). All of these 
organisms belong to genera of bacteria that have been shown to produce MSH (Newton et 
al., (1996) J. Bacterial. 1 78, 1990-1995). This sequence homology indicates that MSH 
biosynthesis in these organisms utilizes a GlcN-Ins Ugase (MshC) in the same manner as 
that described here for M. smegmatis. Other actinomycetes that produce MSH are also 
expected to have a Cys:glcN-his Ugase (MshC) gene homologous to SEQ ID NO: 1 or 3. 
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[00601 A membCT of the femily of polypeptide acetyltransferases shown to be 
responsible for acetylation of Cys-GlcN-Ins to fonn mycothiol has been cloned from M 
tuberculosis genomic sequence and corresponds to an open reading frame designated 
Rv0819. The nucleic acid sequence encoding this protein corresponds to nuclac acids 
911736-^12680 of the M. tuberculosis genome encoding a protein of 315 amino acid 
residues. The derived amino acid sequence is as set forth in SEQ ID NO: 14. Sequence 
searches with the M. smegmatis mshD gene revealed orthologs in other actinomycetes 
including M tuberculosis H37Rv. The M tuberculosis gene CRv0819) was cloned, 
expressed in E. coli, and shown to code for mycothiol synthase activity. Other 
actinomycetes that produce MSH are also expected to have an acetyl-CoA:Cys-GlcN-Ins 
acetyltransferase (MshD) gene homologous to SEQ ID NO: 14 or 15. 

ft 

[00611 A member of the femily of polypeptide glycosyltiransferases shown to be 
responsible for formation of GlcNAc-Ins has been cloned from M tuberculosis genomic 
sequence and corresponds to an open reading frame designated Rv0486. The nucleic acid 
sequence encoding this protein corresponds to nucleic acids 575348-576787 of the M. 
tuberculosis genome encoding a protein of 480 amino acid residues. The derived amino 
acid sequence is as set forth in SEQ ID NO: 20. Other actinomycetes that produce MSH 
are also expected to have an MshA glycosyltiransferase (MshA) gene homologous to SEQ 
ID NO: 19 or 20. 

(00621 The enzyme famiUes as set forth above are utiUzed in the methods of the 
invention. 

[00631 Members of tiie mycothiol biosynthesis femiUes of enzymes are formed in vivo 
by bacteria as part of a mycothiol biosynthesis pathway, most usually in bacteria 
characterized by intracellular production of mycothiol. Additional bacteria from which the 
mycothiol biosynthesis polypeptides can be obtained include actinomycetes. such as M. 
smegmatis. M. tuberculosis, M. leprae. M. bovis. M. intracellulare. M. africanum. M. 
marinarum. M. chelonai. Corynebacterium diphtheria. Actinomycetes israelii, M. avium 
complex (MAC) (Holzman, in Tuberculosis ed. by Rom and Gary (Little, Brown, and 
Company, 1996) Chapter 56), M. ulcerans, M. abscessus, or M scrqfulaceum. and the like. 
Actinomycetes that can be used for this purpose include antibiotic-producing bacteria. 
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Homologous non-mycobacterial ligase proteins can also be obtained from the antibiotic 
producers Streptomyces lincolnensis.Amycolatopsis mediterranei. Amycolatopsis orientalis. 
Streptomyces lavendiOae. Streptomyces coelicolor. Streptomyces rochei, the polyketide 
erythromycin antibiotic producer Saccharopolyspora erythraea, Streptomyces 
violaceomber Tu7, Streptomyces diastochromogens subsp. variabilicolor, and Streptomyces 
sp. OM-6519. 

[0064] Inhibitors of the mycothiol biosynthesis ligases, acetyltransferases and 
glycosyltransferases are particularly well suited as antibiotics against mycothiol-producing 
bacteria since mycothiol production will cease in the absence of the intermediate products. 
GlcNAc-Ins or Cys-GlcN-Ins, produced by activity of the mycothiol biosynthesis enzymes. 
Accordingly, in one embodiment of the present invention, there are provided methods for 
identifying an inhibitors of MshC, MshD, MshA and mycothiol biosynthesis. 

[00651 In one embodiment, the invention provides a method for identifying an inhibitor 
of cysteinerglucosaminyl inositol Ugase. The method includes contacting a candidate 
compound with a cysteine:glucosammyl inositol Ugase in the presence of cysteine and a 
glucosaminyl inositol or derivative thereof under suitable conditions, and detennining the 
presence or absence of ligation of cysteine to the glucosaminyl inositol or derivative 
thereof. For example, if the test compound is a putative inhibitor of Ugase activity of the 
polypeptide Ugase, the absence of Ugated Cys-GlcN-Ins indicates the candidate compound 
is an inhibitor of the activity of the polypeptide as a Ugase. Similarly, if the test compound 
is assayed as a putative inhibitor of MshC in mycothiol-producing bacteria, the presence of 
excess GlcN-Ins indicates that the candidate compound is an inhibitor of the activity of the 
ligase for Unkage of cysteine or a cysteine derivative to a glucosaminyl inositol. On the 
other hand, in such assays, the presence of Cys-GlcN-Ins indicates that the test compound is 
not an mhibitor of MshC activity. In another embodhnent, the candidate compound is 
contacted witii a cysteinerglucosaminyl inositol Ugase in the presence of cysteine, a 
glucosammyl inositol or derivative thereof, ATP and pyrophosphatase, under suitable 
conditions, and detecting the resulting inorganic phosphate with a colorimetiic or 
fluorometric assay known in the art. 
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10066] In one embodiment, the cysteine-.glucosaminyl inositol ligase of the method is 
characterized as having an amino acid sequence with 35% or more sequence identity to 
SEQ ID NO: 2 or 4, or conservative variations thereof, for example, 35%, 40%, 45%, 50%, 
55%, 60%, 65%, 70%. 75%, 80%, 85%, 90%, 95% or 100% identity thereto, and 
cysteinerglucosaminyl mositol Ugase activity, hi another embodiment, a derivative of 
glucosaminyl inositol is used in the method of the invention. Such derivatives may include, 
but are not limited to D-glucosamine. A glucosaminyl inositol substrate for use in screening 
for inhibitors of Ugase activity is lD-m;;o-inosityl 2-amino-2-deoxy-flr-D-glucopyranoside 
(GlcN-Ins). In another aspect of the invention, the suitable conditions include the presence 
of ATP during Ugation of cysteine to the glucosaminyl inositol or derivative thereof. 

[00671 As set forth above, the polypeptides can be derived from bacteria, including 
actinomycetes. In one embodunent of identifying an inhibitor of cysteinetglucosaminyl 
inositol Ugase, the Ugase is produced in an actinomycete. Additionally, the invention 
provides an inhibitor of cysteinerglucosaminyl inositol Ugase identified by the method of 
the invention. 

[0068] • Similarly, the invention provides a method for identifying an inhibitor of acetyl- 
CoAxysteinyl glucosaminyl inositol (acetyl-CoA:Cys-GlcN-Ins) acetyltransferase (MshD). 
The method includes contacting a candidate compound with an acefyl-CoA:Cys-GlcN-his 
acetyltransferase in the presence of a cysteinyl glucosaminyl inositol (Cys-GlcN-his) and 
acetyl-CoA, under suitable conditions and determining the presence or absence of a transfer 
of acetyl to the Cys-GlcN-his. hi the embodiment of the invention, the substantial absence 
of a transfer of acetyl is indicative of a candidate compound tiiat inhibits activity of tiie 
acetyltransferase. For example, if the test compound is a putative inhibitor of 
acetyltransferase activity of apolypeptide acetyltransferase, the absence of an acetylated 
Cys-GlcN-his (mycothiol) mdicates the candidate compound is an inhibitor of tiie activity 
of the polypeptide as an acetyltransferase. Similarly, if the test compound is assayed as a 
putative mhibitor of MshD in mycothiol-producing bacteria, the presence of excess Cys- 
GlcN-his indicates tiiat the candidate compound is an inhibitor of the activity of tiie 
acetyltransferase for Unkage of an acetyl group to a Cys-GlcN-Ins. On the otiier hand, in 
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such assays, the presence of mycothiol indicates that the test compound is not an inhibitor 
of MshD activity. 

[00691 In one embodiment, the acetyl-CoA:Cys-GlcN-Lis acetyltransferase of the 
method is characterized as having an amino acid sequence with 35% or more sequence 
identity to SEQ ID NO: 14 or 15, or conservative variations thereof, for example, 35%, 
40%, 45%, 50%, 55%, 60%, 65%, 70%. 75%, 80%, 85%, 90%, 95% or 100% identity 
thereto, and acetyl-CoA:Cys-GlcN-Ins acetyltransferase activity. In one embodiment of the 
invention, the Cys-GlcN-Ins is lD-mj;o-inosityl 2-L-cysteinylamido-2-deoxy-a-D- 
glucopyranoside. 

[00701 As set forth above, the polypeptides can be obtained from bacteria, including 
actinomycetes. In one embodim«it of identifying an inhibitor of acetyl-CoA:Cys-GlcN-Ins 
acetyltransferase, the acetyltransferase is produced in an actinomycete. Additionally, the 
invention provides an inhibitor of acetyl-CoAtCys-GlcN-Ins acetyltransferase identified by 
the method of the invention. 

(00711 hi still another related embodiment of the invention, a method for identifying an 
inhibitor of MshA glycosyltransferase (MshA) is provided. The method includes contacting 
a candidate compoimd with a mycothiol-producing bacterium under suitable conditions and 
determining the presence or absence of iD-myo-inosityl 2-acetamido-2-deoxy-a-D- 
glucopyranoside (GlcNAc-Ins) within the mycothiol-producing bacterium. In the 
embodiment of the invention, the substantial absence of GlcNAc-Ins within the bacterium is 
indicative of a compound that inhibits activity of the glycosyltransferase. For example, if 
the test compound is a putative inhibitor of glycosyltransferase activity of the polypeptide 
glycosyltransferase, the absence of GlcNAc-Ins indicates the candidate compound is an 
inhibitor of the activity of the polypeptide as a glycosyltransferase. On the other hand, in 
such assays, the presence of GlcNAc-Ins indicates that the test compound is not an inhibitor 
of MshA activity. 

[0072] In one embodiment, the MshA glycosyltransferase of the method is characterized 
as having an amino acid sequence with 35% or more sequence identity to SEQ ID NO: 19 
or 20, or conservative variations thereof, for example, 35%, 40%, 45%, 50%, 55%, 60%, 
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65%, 70%. 75%, 80%, 85%, 90%. 95% or 100% identity thereto, and having MshA 
glycosyltransferase activity. 

[0073] As set forth above, the polypeptides can be obtained from bacteria, including 
actinomycetes. In one embodiment of identifying an inhibitor of MshA glycosyltransferase. 
the glycosyltransferase is produced in an actinomycete. Additionally, the invention 
provides an inhibitor of MshA glycosyltransferase identified by the method of the invention. 

[0074] ■ In still another embodiment, the invention provides a method for identifying an 
mhibitor of mycothiol biosynthesis. The method includes contacting a candidate compound 
with a mycothiol-producing bacterium, under suitable conditions, and determining the 
presence or absence of mycothiol within the mycothiol-producing bacterium. The 
substantial absence of mycothiol is indicative of a candidate compound that inhibits 
mycothiol biosyntliesis. The inhibition of mycothiol biosynthesis can be by, but is not 
limited to, inhibition of cysteine:glucosaminyl inositol Ugase, acetyl-CoA:Cys-GlcN-Ins 
acetyltransferase or MshA glycosyltransferase. Additionally, the excess or absence of 
intennediates of the mycothiol biosynthesis is indicative of an inhibitor of mycothiol 
biosynthesis. In another embodiment, the invention provides an inhibitor of mycothiol 
biosynthesis identified by the method. 

[0075] In one embodiment, the mycothiol-producing bacterium of the method is an 
actinomycete. Additionally, the invention provides an inhftitor of mycothiol biosyathesis 
idraitified by the method of the invention. 

[0076] In an alternative embodiment of the invention, methods are provided for 
decreasing the virulence in mammaHan cells of a pathogenic MshC-producing, MshD- 
producing or MshA-producing bacterium, such as an actinomycete. By virulence is meant 
the relative power and degree of pathogenicity possessed by organisms to produce disease 
as measured by clinical symptoms particular to the disease under consideration. For 
example, the virulence of M tuberculosis is measured with reference to the manifestation in 
an infected individual of the clinical symptoms recognized by a medical practitioner as 
indicative of tuberculosis. In the invention method for decreasing the virulence of 
pathogenic MshC-producing, MshD-producing or MshA-producing bacteria, an inhibitor of 
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MshC, MshD or MshA (for example, one identified by the above-described screening 
method), respectively, is introduced into the bacterium. 

[00771 Intracellular uptake of the inhibitor by the treated bacterium results in decreased 
activity of the enzyme, thereby decreasing mycothiol biosynthesis by the bacterium as 
compared with untreated control bacterium. Hence, the virulence of the treated bacterium is 
reduced. For example, for treatment of isolated mammaHan cells, the introducing can 
comprise culturing the bacterium in the presence of the inhibitor. Alternatively, for 
treatment of mammaUan cells contained in a Uving organism, the inhibitor may be 
administered systemically to the Uving organism. Pathogenic MshC-producing, MshD- 
producing or MshA-producing bacteria whose virulence can be reduced according to the 
invention methods include such actinomycetes as M. smegmatis, M. tuberculosis. M. leprae, 
M. bovis (particularly in bovine subjects), M intracellulare. M. qfncanum. andM. 
marinarum. M. chelonai, Corynebacterium diphtheriae. AcHnomyces israelii, M- avium 
complex (MAC), M ulcerans, M. abscessus, M.scrqfulaceum. and the like. 

[0078] In one embodiment of the invention, a method for decreasing the virulence of a 
pathogenic cysteine:glucosaminyl inositol Ugase-producing bacterium m mammalian cells 
is provided. Specifically, the method includes introducing an inhibitor of 
cysteine^glucosaminyl inositol Ugase activity into the bacterium and observing the effect on 
the activity of the Ugase. M the method of the invention, where the intracellular presence of 
the inhibitor causes a decrease in activity of the Ugase, mycothiol biosynthesis by the 
bacterium is also decreased, as compared with untreated control bacterium. 

[0079] The invention also provides a method for decreasing the virulence of a 
pathogenic acetyl-CoA:Cys-GlcN-Ins acetyltransferase-producing bacterium in mammalian 
ceUs. The method includes introducing an inhibitor of acetyl-CoA:Cys-GlcN-Ins 
acetyltransferase activity into the bacterium, where the intracellular presence of the inhibitor 
decreases activity of the acetyltransferase. Such a decrease also decreases mycothiol 
biosynthesis by the bacterium as compared with untreated control bacterium. 

[0080] Similarly, the invention provides a method for decreasing the virulence of a 
pathogenic MshA glycosyltransferase-producing bacterium in mammalian ceUs. The 
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method includes introducing an inhibitor of MshA glycosyltransferase activity into the 
bacterium. The intracellular presence of the inhibitor decreases activity of the 
glycosyltransferase, thereby decreasing mycothiol biosynthesis by the bacterium as 
compared witii untreated control bacterium. 

[0081] The inhibitors used in the invention methods for decreasing the virulence of a 
pathogenic MshC-producing, MshD-produdng or MshA-producing bacterium may either 
inhibit intraceUular production of the enzyme or inhibit intracellular catalytic activity of tiie 
enzyme. In one embodiment, the inhibitor inhibits intracellular production of mycothiol. 

[0082] In various embodiments of the present invention there are provided inhibitors of 
the enzymes of mycothiol biosynthesis. These inhibitors may be identified, for example, by 
the screening method set forth above. For example, tiie inhibitor can be, but is not limited 
to a polypeptide, a polynucleotide or a small molecule. 

[0083] In one embodiment, where inhibition of MshC is desired, the invention Ugase 
inhibitors are derived fi-om L-cysteine by replacing the carboxyl group with a moiety that 
binds the enzyme active site. Examples of tiie type of moiety fliat can be used to replace the 
carboxyl group in L-cysteine to form an inhibitor of tiie Ugases are selected from moieties 
having tiie chemical stiuctijre CH20P0(0H)0R, wherein R is derived eitiier from AMP or 
from a cycUtol bearing one or more branched or unbranched alkyl residues. In anotiier 
embodiment where inhibition of MshC is desired, tiie invention mhibitors are derived from 
L-cysteine by replacing tiie carboxyl group tiierein witii a moiety having tiie chemical 
structure CONHSO2OR, wherein R is derived from AMP or R is a cycUtol bearing one or 
more branched or unbranched alkyl residues. Suitable alkyl residues for this purpose 
include, but are not limited to, tiiose containing from 1 to 10 carbons, for example 2 to 8 
carbons, 3 to 6 carbons, or 4 to 5 carbons. 

[0084] In anotiier embodiment according to tiie present invention, tiie invention inhibitor 
is an anti-sense oUgonucleotide complementary to a target region in a messenger RNA tiiat 
encodes a polypeptide having an amino acid sequence segment witii 35% or more sequence 
identity to tiie amino acid sequence of SEQ ID NO: 2 or 4, 35% or more sequence identity 
to tiie amino acid sequence of SEQ ED NO: 14 or 15 or 35% or more sequence identity to 
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the amino acid sequence of SEQ ID NO: 19 or 20. For example, the candidate compound 
can inhibit intracellular production or activity of the cysteine-.glucosaminyl inositol Ugase, 
acetyl-CoA:Cys-GlcN-Ins acetyltransferase or MshA glycosyltransferase. Suitable 
conditions for conducting invention drug screening methods are well known in the art and 
are described, for example, in the Examples below. 

100851 Altematively, an anti-sense oUgonucleotide can be designed to hybridize under in 
vivo conditions with a messenger RNA that encodes a polypeptide having an N-terminal 
amino acid sequence as set forth in SEQ ID NO: 2 or 4, or contains an amino acid segment 
as set forth in SEQ ID NOS: 7, 8 or 9, or a conservative variation thereof. 

[00861 The anti-sense oUgonucleotide can comprise from about 10 to about 60 nucleic 
acid residues, for example from 10 to about 50. or from 10 to about 40, 30 or 20 nucleic 
acid residues. 'Hybridization" refers to the process by which a nucleic acid strand joins 
with a complementary strand through base pairing. Hybridization reactions can be sensitive 
and selective so that a particular sequence of interest will join with a complementary strand 
even in samples in which it is present at low concentrations. Suitable intracellular 
conditions for hybridization of an anti-sense oUgonucleotide to messenger KNA will be 
determined by the particular bacterium used in the invention method. In general, the pH, 
temperature and salt concentration must be comparable to intraceUular conditions in the test 
bacteriiun. 

[00871 hi yet another embodiment according to the present invention, there are provided 
methods for increasing sensitivity to an antibiotic of a pathogenic MshC-producmg. MshD- 
producing or MshA-produdng bacterium. 

[0088] hi such a method for increasmg sensitivity of a pathogenic cysteine: glucosaminyl 
inositol Ugase-producing bacterium in mammaUan ceUs, an invention inhibitor of 
cysteine:glucosanimyl inositol Ugase activity is introduced mto the bacterium. The 
intraceUular presence of the invention inhibitor in the bacterium decreases activity of the 
Ugase, thereby decreasing mycothiol biosynthesis by the bacterium as compared with 
untreated control bacterium so as to mcrease sensitivity of the bacterium to an antibiotic. 
The inhibitor can inhibit intraceUular production of the Ugase or inhibit intraceUular Ugase 
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activity of the ligase. In one embodiment, sensitivity of the bacterium to the antibiotic is 
increased while liie bacterium is within a mammalian cell by a decreased activity of the 
Ugase in the bacterium while contained within the host mammalian cell. The inhibitor can 
be introduced into the bacterium, for example, by culturing the bacterium in the presence of 
the inhibitor. Bacteria whose sensitivity to antibiotics can be increased by practice of the 
invention methods include such pathogenic bacteria as various actinomycetes. Specific 
examples of bacteria whose sensitivity to antibiotics can be increased by the invention 
methods include M. smegmatis, M. tuberculosis. M. leprae. M. bovis. M. intracellulare. M. 
africanum. M. marinarum. M. chelonai, Corynebacterium diphtheria. Actinomyces israelii, 
M. avium complex (MAC), M. ulcerans, M. abscessus, M. scrofiilaceum, and the like. In 
another embodiment, the bacterium is an actinomycete and the inhibitor inhibits 
intracellular production of mycothiol. 

[0089] Similarly, in a method for increasing sensitivity of a pathogenic acetyl-CoA:Cys- 
GlcN-Ins acetyltransferase-producmg bacterium in mammaUan cells to an antibiotic, an 
inhibitor of acetyl-CoA:Cys-GlcN-Ins acetyltransferase activity is introduced into the 
bacterium. The intracellular presence of tiie inhibitor in tiie bacterium decreases activity of 
tiie acetyltransferase, thereby decreasing mycothiol biosynthesis by the bacterium as 
compared with untreated control bacterium so as to increase sensitivity of tiie bacterium to 
an antibiotic. The inhibitor can inhibit intiracellular production of the acetyltransferase or 
inhibit intracellular activity of flie acetyltransferase. In one embodunent, sensitivity of tiie 
bacterium to tiie antibiotic is increased while tiie bacterium is wifliin a mammalian cell by a 
decreased activity of tiiie acetyltiansferase in tiie bacterium while, contained witiiin tiie host 
mammaUan cell. The inhibitor can be introduced into tiie bacterium, for example, by 
culturing tiie bacterium in the presence of the inhibitor. 

[0090] Similarly, in a metiiod for increasing sensitivity of a pathogenic acetyl-CoA.Cys- 
GlcN-Ins acetylti^ferase-producing bacterium in mammalian cells to an antibiotic, an 
inhibitor of acetyl-CoA:Cys-GlcN-Ins acetyltransferase activity is introduced into tiie 
bacterium. The intiaceUular presence of tiie inhibitor in tiie bacterium decreases activity of 
tiie acetyltransferase, tiiereby decreasing mycotiiiol biosynthesis by tiie bacterium as 
compared witii untreated contiol bacterium so as to increase sensitivity of tiie bacterium to 
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an antibiotic. The inhibitor can inhibit intraceUular production of the acetyltransferase or 
inhibit intracellular activity of the acetyltransferase. In one embodiment, sensitivity of the 
bacterium to the antibiotic is increased while the bacterium is within a mammaUan cell by a 
decreased activity of the acetyltransferase in the bacterium while contained within the host 
mammalian cell. The inhibitor can be introduced into the bacterium, for example, by 
culturing the bacterium in the presence of the inhibitor. 

[00911 Additionally, in a method for increasing sensitivity of a pathogenic MshA 
glycosyltransferase-producing bacterium in mammaUan cells to an antibiotic, an inhibitor of 
MshA glycosyltransferase activity is introduced into the bacterium. The intracellular 
presence of the inhibitor in the bacterium decreases activity of the glycosyltransferase. 
thereby decreasing mycothiol biosynthesis by the bacterium as compared with untreated 
control bacterium so as to increase sensitivity of the bacterium to an antibiotic. The 
inhibitor can inhibit int^cellular production of the glycosyltransferase or inhibit 
intracellular activity of the glycosyltransferase. In one embodiment, sensitivity of the 
bacterium to the antibiotic is increased while the bacterium is within a mammaUan ceU by a 
decreased activity of the glycosyltransferase in the bacterium while contained withm the 
host mammaUan ceU. The inhibitor can be introduced into the bacterium, for example, by 
culturing the bacterium in the presence of the inhibitor. 

[0092] In still another embodiment, tiie invention provides a method for mcreasing 
sensitivity of a pathogenic mycothiol-producing bacterium in mammaUan ceUs to an 
antibioti:c, by introducing an inhibitor of endogenous bacterial mycothiol biosynthesis into 
the bacterium. The intiraceUular presence of the inhibitor in tiie bacterium decreases 
mycothiol biosynthesis by the bacterium as compared with untreated control bacterium so 
as to increase sensitivity of the bacterium to an antibiotic. The inhibitor can inhibit 
intraceUular production of cysteine: glucosaminyl inositol Ugase, acetyl-CoA:Cys-GlcN-Ins 
acetyltransferase or MshA glycosyltransferase or can inhibit activity of the same. The 
inhibitor can be introduced into the bacterium, for example, by culturing the bacterium in 
tiie presence of tiie inhibitor. 

[0093] Bacteria whose sensitivity to antibiotics can be increased by practice of the 
invention mettiods include such patiiogenic bacteria as various actinomycetes. Specific 
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examples of bacteria whose sensitivity to antibiotics can be increased by the invention 
methods" include M smegmatis. M. tuberculosis. M. leprae, M. bovis. M. intracellulare. M. 
africanum. M. marinarum, M. chelonai, Corynebacterium diphtheria. Actinomyces israelii, 
M. avium complex (MAC), M ulcerans, M. abscessus, M. scrojulaceum, and the like. In 
another embodiment, the bacterium is an actinomycete and the inhibitor inhibits 
intracellular production of mycothiol. 

[0094] In accordance with the above, the invention also provides a method of 
synthesizing mycothiol in vivo. By mserting mshA (for example, SEQ ID NO: 49 or a 
nucleic acid sequence encoding a protein having an amino acid sequence as set forth in SEQ 
ID NO: 19). mshB, mshC (for example, SEQ ID NO: 1 or 3) and mshD (for example. SEQ 
ID NO: 48 or a nucleic acid sequence encoding a protein having an ammo acid sequence as 
set forth' in SEQ ID NO: 15). the genes for the four enzymes of mycothiol production, into a 
plasmid and inserting the plasmid into an organism, all four enzymes are expressed. With 
this expression, mycothiol is produced by the host cell. This method may be used to 
stimulate mycothiol production in the organism or increase existing mycothiol production. 
Such an increase of mycothiol within an organism serves to increase tolerance of the 
organism to antibiotics. Such an increase in antibiotic tolerance is useful to protect 
antibiotic-producing organisms from the toxic effects of flie antibiotics tiiey produce. 

(00951 Therefore in an embodiment according to the present invention, there are 
provided methods for increasing production of antibiotic by antibiotic-producing bacteria by 
transforming tiie antibiotic-producing bacteria witii a polynucleotide that increases 
intracellular mycothiol production by the bacteria in culture. The increase in intracellular 
production of mycotiuol increases the production of antibiotic by the bacteria by increasing 
resistance of tiie bacteria to the antibiotic. Generally, in industiial appUcations where 
antibiotic is produced from bacteria for commercial purposes, the antibiotic-producing 
bacteria are cultured under conditions suitable for production of the antibiotic, and the 
antibiotic is recovered from the culture media. 

[0096] In one embodiment, the compound that increases intracellular mycothiol 
production by the bacteria is expressed intracellularly by ttie bacteria. In one embodiment, 
the bacteria is actinomycetes. For example, the actinomycetes can be transformed with a 
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polynucleotide, such as an expression vector, that encodes one or more enzymes involved in 
the mycothiol biosynthesis pathway and which produces mycothiol in culture. 
Recombinant expression of the polypeptides m cultured antibiotic-producing cells can be 
useM for increasing the resistance of the production cells to the toxic effect upon 
themselves of the antibiotics they produce. Thus, the level of antibiotics in the culture media 
can be increased without causing death of the production cells, thereby mcreasing the 
efficiency of industrial antibiotic production methods. Suitable polynucleotides that can be 
used to transform antibiotic-producing bacteria can contain, for example, SEQ ID NOS: 1, 



48 or 49. 



[0097] In yet another embodiment according to the present invention, there are provided 
Uve mutant actinomycetes, whose genomes comprise a modification in an endogenous 
enzyme of the mycothiol biosynthesis pathway and thereby reduce mycothiol synthesis. 
Appropriate modification of genes for mycothiol biosynthesis m mycobacteria can reduce 
their survival in mammaUan macrophages. Modification of any one of the endogenous 
cysteine:glucosaminyl inositol ligase gene, acetyl-CoA:Cys-GlcN-Ins acetyltransferase gene 
or MshA glycosyltransferase gene can reduce function of an endogenous 
cysteine:glucosaminyl inositol ligase. acetyl-CoA:Cys-GlcN-Ins acetyltransferase or MshA 
glycosyltransferase, respectively, while cell surface proteins and Upids should be 
substantially unaffected. As a result, invention Uve mutant actinomycetes exhibit the 
phenotype of transient survival m mammaUan white blood cells, such as murine or human 
white blood cells, for an immune response raising period of time. Such genetically 
engineered Uve mutant actinomycetes will survive in mammaUan white blood cells for a 
period of time fiom 1 to 30 days, for example from 4 to 25 days or from 5 to 20 days, but in 
no event for more than 30 days. Due to lack of intracellular cysteine: glucosaminyl mositol 
Ugase. acetyl-CoA:Cy5-GlcN-Ins acetyltransferase or MshA glycosyltransferase. the 
invention live mutant bacterium wiU feU to produce sufficient mycothiol. Hence, the 
mutant Uve bacterium is unable to survive the oxidative stress inherent m the intracellular 
environment of mammaUan white blood cells long enough to estabUsh infection m the ceUs 
or to estabUsh infection in an immunocompetent mammal containing such white blood cells. 
In one embodiment, the Uve mutant contains a modification of the acetyl-CoA:Cys-GlcN- 
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Ins acetyltransferase gene or the MshA glycosyltransferase gene and the resulting mutant is 
resistant to isoniazid. 

[00981 Thus, invention live mutant actinomycetes possess a combination of features 
desired for a vaccine effective in mammals against infection by such pathogenic 
actinomycetes as M smegmatis. M. tuberculosis. M. leprae. M. bovis. M. intracellulare. M. 
africanum. M. marinarum. M. chelonai. Corynebacterium diphtheria. Actinomycetes 
israelii, M. avium complex (MAC), M. ulcerans, M. abscessus, M. scrofiilaceum. and the 
like An individual ie.g. . an animal, such as a mouse, a fenn animal or a human) to which 
the Uve mutant is administered according to a protocol appropriate for inducing a protective 
immune response and who has not previously been infected by the counterpart wild type 
wiU mount an immune response to the vaccine, for example an immune response sufficient 
to protect the individual against future infection by the corresponding wild type Uve 
pathogen. Alternatively, the invention Uve mutant actinomycetes are useful as a research 
tool to investigate the properties desirable in a Uve mutant vaccine. 

[00991 The invention also provides amethod for inhibiting growth of Cys-GlcN-Ins- 
producing bacterium. acetyl-CoA:Cys-GlcN-Ins acetyltransferase-producing bacterium or 
GlcNAc-his-producing bacterium. In the mefliod reducing the growth of Cys-GlcN-Ins- 
producing bacterium, an inhibitor of intraceUular cysteine:glucosaminyl inositol Ugase is 
administered to the mammal, hi the method reducing the growth of acetyl-CoA:Cys-GlcN- 
Ins acetyltransferase-producing bacterium, an inhibitor of intracellular acetyl-CoAtCys- 
GlcN-his acetyltransferase is administered to the mammal. Similarly, in the method 
reducing the growth of GlcNAc-Ms-producing bacterium, an inhibitor of intracellular MshA 
glycosyltransferase is administered to the mammal. Such administration of an inhibitor of 
cysteinetglucosaminyl inositol Ugase, acetyl-CoA:Cys-GlcN-Ins acetyltransferase or MshA 
glycosyltransferase wiU inhibit growth of the bacterium in the mammal, hi one 
embodiment, the bacterium is a mycothiol-producing bacterium. Bacteria whose groAvth 
can be inhibited by the practice of the invention methods utiUzing such inhibitors can 
include such pathogenic bacteria as various actinomycetes. Specific examples of bacteria 
whose growth can be inhibited by the invention methods include Af. smegmatis, M. 
tuberculosis. M. leprae. M. bovis. M. intracellulare, M. africanum. M. marinarum. M. 
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chelonai. Corynebacterium diphtheria, Actinomyces israeliU M. avium complex (MAC). M. 
ulcerans, M. abscessus, M. scrofidaceum, and the like. 

[0100] . It appears that the difficulty in purifying the Cys:GlcN-InsUgase derives in 
fiom the presence of two different forms of the MshC protein and their tendency to 
oUgomerize. Some of the different forms apparently elute separately on some 
chromatography columns but. once separated, can rearrange to regenerate multiple forms. 
The Ugase activity eluted from Sephadex GlOO as an ^parent tetramer but the purified 
Ugase was collected as what appeared to be partially resolvable 02 and p2 dimers. Since the 
M. tuberculosis gene did not exhibit upstream homology to the M. smegmatis gene in the 
region coding for the N-terminal extension of the larger protein, this problem may be unique 
to M. smegmatis. The significance of multiple forms of the enzyme under physiologic 
conditions is not clear. 

[0101] • Bacteria generally produce one aminoacyl-tRNA synthetase for each of the 20 
natural amino acids, but some exceptions are known (T. Meinnel et al. (1995) in tRNA: 
Structure. Biosynthesis, and Function (So\l,Tt. and RajBhandary.U.. Eds.) pp 251-292. 
ASM Press, Washington, DC). However, in those cases where two aminoacyl-tRNA 
synthetases for the same amino acid have been identified, fliey have nearly identical size. 
This is not the case with CysS and CysS2 (MshC). which inM tuberculosis are comprised 
of 475 and 414 residues, respectively. There are structural similarities and differences 
between CysS and MshC from M. tuberculosis (Figure 5). CysS lacks the N-terminal 
sequence (residues 1-19, Figure 4) partially conserved in the mshC genes. The cysS and the 
mshC genes possess the "fflOH" (T. Webster et al. (1984) Science 226, 1315-131719 and 
J.J. Burbaum et al. (1990) Proteins 7. 99-111). and ''KMSKS" (C. Hountondji et al. (1986) 
Biochemistry 25:16-21 and C. Hountondji et al. (1990) Biochemistry 29, 8190-8198) 
signatures characteristic of Class I aminoacyl-tRNA synthetases (T. Meinnel et al., supra). 
However, whereas the CysS proteins contain the actual HIGH sequence near residue 40 
(Figure 5), the sequence is H(L/M)GH in the MshC proteins shown in Figure 4. The 
substitution of Leu or Met for lie is found in several Class I aminoacyl-tRNA synthetases 
(T. Meinnel. supra). The CysS protein of M tuberculosis has a 76 residue extension beyond 
the C-terminus of MshC which has substantial identity (29%) wifli corresponding 69 residue 
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extension in the CysS ftom E. coli (Figure 5). Overall, the CysS of M. tuberculosis is 42% 
identical to the CysS oiE. coli and 32% identical to MshC of M tuberculosis. Finally, 
CysS has been normally found to be active as a monomer whereas MshC is active in both 
dimer and tetramer forms. Thus, it has been discovered that CysS of M. tuberculosis is the 
true cysteinyl-tRNA synthetase and the protein originally classified as CysS2 is actually the 
Cys:GlcN-Ins Ugase (MshC) It seems logical that mshC originated from the cysS gene 
following a gene duplication. 

[01021 Figure 7 is a schematic representation of the chemistry of cysteinyl-tRNA 
synthetase and Cys:GlcN-Ins Ugase reactions. The enzyme mechanism of the aminoacyl- 
tRNA synthetases involves activation of cysteine to produce a cysteinyl-AMP intermediate 
(T. Meinnel, supra). The activated cysteinyl group is subsequently transferred to the 2' or 
y ribosyl hydroxyl at the 3' terminus of t-RNA"^ to produce the charged cysteinyl ester on 
t-RNA=^ (Figure 7). The initial step, formation of AMP-Cys (1), is the same for the Ugase 
reaction, The Ugase mechanism differs chemically by recognition of GlcN-Ins in place of 
the ribose of t-RNA'^" and in the attack of the amino group, in place of ahydroxyl group, 
upon AMP-Cys (Figure 5). A general base on the enzyme presumably functions to remove 
a proton from the hydroxyl or amino group leading to formation of a tetrahedral 
intemiediate (2) which decomposes to form an ester (3), in the case of cysteinyl-tRNA 
synthetase, or an amide (4) in the case of the Ugase. The later process is chemically more 
favorable owing to the greater nucleophiUcity of amines over alcohols and the greater 
thermodynamic stabiUty of amides relative to esters. These features, inherent in the 
chemistry of the reactions, make evolution of the Ugase ftom a cysteinyl-tRNA synthetase a 
highly plausible route. 

[0103] . A "conservative variation" in an amino acid sequence is a sequence that differs 
from a reference sequence by one or more conservative or non-conservative amino acid 
substitutions, deletions, or insertions, particularly when such a substitution occurs at a site 
that is not the active site of the molecule, and provided that the polypeptide essentiaUy 
retains its functional properties. A conservative amino acid substitution, for example, 
substitutes one amino acid for another of the same class (e.g., substitution of one 
hydrophobic amino acid, such as isoleucine, vaUne, leucine, or methionine, for another, or 
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substitution of one polar amino acid for another, such as substitution of arginine for lysine, 
glutamic acid for aspartic acid or glutamine for asparagine). One or more amino acids can 
be deleted, for example, from a Ugase polypeptide, resulting in modification of the structure 
of the polypeptide, without significantly altering its biological activity. For example, 
carboxyl-terminal amino acids that are not required for Ugase or Ugase activity of the 
mycothiol biosynthesis polypeptides can be ranoved. 

[0104] Biologically fimctional viral and plasmid DNA vectors capable of expression and 
repUcation in a host are known in the art. Such vectors are used to incorporate DNA 
sequences of the invention. In general, expression vectors containing promoter sequences 
which faciUtate the efficient transcription of the inserted eukaryotic genetic sequence are 
used in connection with the host The expression vector typically contains an origin of 
repUcation. a promoter, and a terminator, as well as specific genes that are capable of 
providing phenotypic selection of the transformed cells. 

[0105] In addition to expression vectors known in the art such as bacterial, yeast and 
mammaUan expression systems, baculovirus vectors may also be used. One advantage to 
expression of foreign genes in this invertebrate virus expression vector is that it is capable 
of expression of high levels of recombinant proteins, which are antigenically and 
functionaUy similar to their natural counterparts. Baculovirus vectors and the appropriate 
insect host ceUs used in conjunction with the vectors wiU be known to those skilled in the 
art. 

[01061 The term "recombinant expression vector" refers to a plasnud, virus or other 
vehicle known in the art that has been manipulated by insertion or incorporation of the 
cysteine:glucosaminyl inositol Ugase genetic sequences. Such expression vectors contain a 
promoter sequence that faciUtates the efficient transcription of the inserted genetic sequence 
of the host. The expression vector typicaUy contains an origin of repUcation, a promoter, as 
weU as specific genes which allow phenotypic selection of the transformed ceUs. Vectors 
suitable for use in the present invention include, but are not Umited to, the T7-based 
expression vector for expression in bacteria (Rosenberg, et al., (1987) Gene, 56:125), the 
pMSXND expression vector for expression in mammaUan ceUs (Lee and Nathans, (1988) J. 
Biol. Chem., 263:3521) and baculovirus-derived vectors for expression in insect ceUs. The 
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DNA segment can be present in flie vector operably linked to regulatory elements, for 
example, a promoter (e.g.y T7, metallothionein I, or polyhedrin promoters). 

[0107] The vector may include a phenotypically selectable marker to identify host ceUs 
which contain the expression vector. Examples of markers typically used in prokaryotic 
expression vectors include antibiotic resistance genes for ampicillin (p-lactamases), 
teti^yciine and chloramphenicol (chloramphenicol acetyltransferase). Examples of 
markers typically used in prokaryotic expression vectors used in mycobacteria can include, 
but are not limited to kanomycin and streptomycin resistance genes. Examples of such 
markers typically used in mammaliai expression vectors include the gene for adenosine 
deaminase (ADA), aminoglycoside phosphotiansferase (neo, G418). dihydrofolate 
reductase (DHFR). hygromycin-B-phosphotiransferase (HPH), thymidine kinase (TK), 
xanthine guanine phosphoribosyltransferse (XGPRT, gpt). inomycin, streptomycin and 
kanamycin. 

[01081 The isolation and purification of host cell expressed polypeptides may be by any 
conventional means such as. for example, preparative chromatographic separations and 
immunological separations such as tiiose involving the use of monoclonal or polyclonal 
antibody. 

[01091 Transformation of the host cell with the recombinant DNA may be carried out by 
conventional techniques well known to those sMUed in the art. Where the host is 
prokaryotic. such as E. coH, competent cells which are capable of DNA uptake can be 
prepared from cells harvested after exponential growfli and subsequentiy treated by 
electroporation or tiie CaCb method using procedures well known in tiie art. Alternatively, 
MgCl2 or RbCl covdd be used. 

[01101 Where the host used is a eukaryote, various methods of DNA tiansfer can be 
used. These include transfection of DNA by calcium phosphate-precipitates, conventional 
mechanical procedures such as microinjection, insertion of a plasmid encased in liposomes, 
or the use of virus vectors. Eukaryotic cells can also be cotransformed with DNA sequences 
encoding the mycotiiiol biosyntiiesis polypeptides, and a second foreign DNA molecule 
encoding a selectable phenotype, such as the herpes simplex tiiymidine kinase gene. 
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Another method is to use a eukaiyotic viral vector, such as simian virus 40 (SV40) or 
bovine papilloma virus, to transiently infect or transform eukaryotic ceUs and express the 
protein. (Eukaryotic Viral Vectors. Cold Spring Harbor Laboratory, Gluzman ed., 1982). 
Examples of mammaUan host cells include COS, BHK, 293, and CHO cells. 

lOlll] Eukaryotic host cells may also include yeast. For example, DNA can be 
expressed in yeast by inserting the DNA into appropriate expression vectors and introducing 
the product into the host cells. Various shuttle vectors for the expression of foreign genes in 
yeast have been reported (Heinemann. J. et al., (1989) Nature, 340:205; Rose. M. et al.. 
(1987) Gene, 60:237). 

[01121 The mvention also provides antibodies that are specifically reactive with 
mycothiol biosynthesis enzyme polypeptides or fragments thereof Such antibodies can be 
used as research tools to aid in isolation of mycothiol biosynthesis enzymes such as MshC, 
MshD or MshA. 

[0113] The invention also provides antibodies that consist essentially of pooled 
monoclonal antibodies with different epitopic specificities, as weU as distinct monoclonal 
antibody preparations are provided. Monoclonal antibodies are made from antigen- 
containing fragments of the protein by methods weU known in the art (Kohler, et al.. Nature, 
256:495, 1975; Current Protocols in Molecular Biology, Ausubel, et al., ed., 1989). 
Monoclonal antibodies specific for MshC polypeptide, MshD polypeptide or MshA 
polypeptide can be selected, for exanq>le. by screening for hybridoma culture supematants 
that react with the MshC polypeptide, MshD polypeptide or MshA polypeptide, but do not 
react with other bacterial Ugases, acetyltransferases or glycosyltransferases, respectively. 

[01141 Additionally, the invention provides antibodies that consist essentially of pooled 
monoclonal antibodies with different epitopic specificities, as well as distinct monoclonal 
antibody preparations are provided. Monoclonal antibodies are made from antigen 
containing Augments of a protein by methods well known in the art (Kohler. et al.. Nature, 
256:495, 1975; Current Protocols in Molecular Biology. Ausubel, et al., ed„ 1989). 
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[0115] The tenn "antibody" as used in this invention includes intact molecules as well as 
fragments thereof, such as Fab, F(ab')2 and Fv, which are capable of binding the epitopic 
determinant. These antibody fragments retain some abiUty to selectively bind with its 
antigen or receptor and are defined as follows: 

(1) Fab, the fragment which contains a monovalent antigen-binding fragment of an 
antibody molecule can be produced by digestion of whole antibody with the 

• enzyme psq)ain to yield an intact Ught chain and a portion of one heavy chain; 

(2) Fab', the fragment of an antibody molecule can be obtained by treating whole 
antibody with pepsin, followed by reduction, to yield an intact light chain and a 
portion of the heavy chain; two Fab' fragments are obtained per antibody 
molecule; 

(3) (Fab')2, the fragment of the antibody that can be obtained by treating whole 
antibody with the enzyme pepsin witiiout subsequent reduction; F(ab')2 is a 
dimer of two Fab' fragments held together by two disulfide bonds; 

(4) Fv, defined as a genetically engineered firagment containing the variable region 
of the Ught chain and the variable region of tiie heavy chain expressed as two 
chains; and 

(5) Single chain antibody ("SCA"), defined as a genetically engineered molecule 
containing the variable region of the Ught chain, the variable region of the heavy 
chain, Unked by a suitable polypeptide Unker as a genetically fused single chain 
molecule. 

[01161 Methods of making these fragments are known in tiie art. (See for example, 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory. New 
York (1988), incorporated herein by reference). 

[0117] As used in this invention, the term "epitope" means any antigenic determinant on 
an antigen to which tiie paratope of an antibody binds. Epitopic determinants usually 
consist of chemically active surfece groupings of molecules such as amino acids or sugar 
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side chains and usually have specific three-dimensional structural characteristics, as well as 
specific charge characteristics. 

[01181 Antibodies that bind to a MshC polypeptide, MshD polypeptide or MshA 
polypeptide of the invention can be prepared using an intact polypeptide or firagments 
containing small peptides of interest as the immunizing antigen. The polypeptide or a 
peptide used to immunize an animal can be derived firom translated cDNA or chemical 
synthesis and can be conjugated to a carrier protein, if desired. Such commonly used 
carriers, which are chemicaUy coupled to the peptide, include keyhole limpet hemocyanin 
(KLED. thyroglobuUn, bovine serum albumin (BSA), and tetanus toxoid. The coupled 
peptide is then used to immunize the animal (e.g., a mouse, arat, or a rabbit). 

[01191 If desired, polyclonal or monoclonal antibodies can be further purified, for 
exanq>le. by binding to and elution from a matrix to which the polypeptide or a peptide to 
which the antibodies were raised is bound. Those of skill in the art will know of various 
techniques common in the immunology arts for purification and/or concentration of 
polyclonal antibodies, as well as monoclonal antibodies (See for example, CoUgan. et al.. 
Unit 9, Current Protocols in Immunology, Wiley Interscience, 1994, incorporated herdn by 
reference). 

[01201 It is also possible to use anti-idiotype technology to produce monoclonal 
antibodies tiiat mimic an epitope. For example, an anti-idiotypic monoclonal antibody made 
to a first monoclonal antibody will have a binding domain in the hypervariable region tiiat is 
tiie "image" of the epitope bound by the first monoclonal antibody. 

[01211 In yet another embodiment, the recombinant cysteine:glucosaminyl inositol hgase 
polypeptide is a fiision protein fiirther comprising a second polypeptide portion having an 
amino acid seqi^ence from a protein unrelated to the cysteine:glucosaminyl inositol Ugase. 
Similarly, in such an embodiment the recombinant acetyl-CoA:Cys-GlcN-Ins 
acetyltransferase polypeptide is a fiision protein finther comprising a second polypeptide 
portion having an amino acid sequence fixjm a protein unrelated to tiie acetyl-CoA:Cys- 
GlcN-Ins acetyltransferase. In still another embodiment, the invention provides a 
recombinant MshA glycosyltransferase polypeptide, which is a fixsion protein fijrther 
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coiiq)rising a second polypeptide portion having an amino acid sequence from a protein 
unrelated to the MshA glycosyltransferase. Such fusion proteins can be functional in a two- 
hybrid assay. 

[0122] Another aspect of the present invention provides a substantially pure nucleic acid 
having a nucleotide sequence that encodes a cysteine:glucosaminyl inositol Ugase 
polypeptide, or a fragment thereof having an amino acid sequence with 35% or greater 
homology to one of SEQ ID NO: 2. In another embodiment, the nucleic acid encodes a 
protein having an amino acid sequence having 40% or more homologous to SEQ ID NO: 2, 
in one embodiment, the protein has an amino acid sequence at least 45% homologous to 
SEQ ID NO: 2, and in another embodiment at least 50% homologous to SEQ ID NO: 2. in 
another embodiment, the nucleic acid encodes a protein having an amino acid sequence 
having 35% or more sequence identity to the amino acid sequence of SEQ ID NO: 15, and 
in still another embodiment, the nucleic acid encodes a protein having an amino acid 
sequence having 35% or more sequence identity to the amino acid sequence of SEQ ID NO: 
19. 

[0123] Furtheraiore, in certain embodiments, the cysteine:glucosaminyl inositol Ugase, 
acetyl-CoA:Cys-GlcN-Ins acetyltransferase or MshA glycosyltransferase nucleic acid will 
compris? a transcriptional regulatory sequence, e.g. at least one of a transcriptional 
promoter or transcriptional enhancer sequence, operably linked to the mshQ mshD or mshA 
gene sequence so as to render the recombinant cysteine:glucosaminyl inositol Ugase, acetyl- 
CoA:Cys-GlcN-Ins acetyltransferase or MshA glycosyltransferase gene sequence suitable 
for use as an expression vector. 

[0124] The present invention also features transgenic non-human organisms, e.g. Uve 
mutant actinomycetes, which either express a heterologous mshC, mshD or mshA gene, or in 
which expression of their own mshC, mshD or mshA gene is disrupted. In addition to the 
other utilities of such organisms disclosed herein, such a bransgenic organism with a 
disrapted mshC gene has utiUty for overproduction of glucosaminyl inositol needed for 
screening (particularly high tiuroughput screening) for compounds that inhibit 
cysteine:glucosaminyl inositol Ugase activity in mycothiol-producing bacteria. Similarly, a 
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transgenic organism with a disrupted mshD gene has utility for overproduction of Cys- 
GlcN-Ins. 

[0125] The present invention also provides a probe/primer comprising a substantially 
purified oligonucleotide, wherein the oligonucleotide comprises a region of nucleotide 
sequence which hybridizes under stringent conditions to at least 10 consecutive nucleotides 
of sense or anti-sense sequence encoding one of the amino acid sequences encompassed by 
SEQ ID NOS: 2, 4, 14, 15, 19, 20, or naturally occurring mutants thereof. 

[0126] Yet another aspect of the invention pertains to a peptidomimetic that binds to or 
interferes with a MshC polypeptide, MshD polypeptide or MshA polypeptide and inhibits 
its respective activity. For example, a peptidomimetic that binds to or interferes with a 
MshC polypeptide can inhibit binding to or linkage of substrate cysteine to glucosaminyl 
inositol or a derivative thereof. An exemplary peptidomimetic is an analog of a peptide 
having the sequence of one of the SEQ ID NOS: 2 or 4. Non-hydrolyzable peptide analogs 
of such residues can be generated using, for example, benzodiazepine, azepine, substituted 
gama-lactam rings, keto-methylene pseudopeptides, beta-turn dipeptide cores, or beta- 
aminoalcohols. Similarly, a peptidomimetic that binds to or interferes with a MshD 
polypeptide can inhibit binding to or linkage of acetyl to substrate Cys-GlcN-Ins or a 
derivative thereof. An exemplary peptidomimetic is an analog of a peptide having the 
sequence of one of the SEQ ID NOS: 14 or 15. Also, a peptidomimetic that binds to or 
interferes with a MshA polypeptide can inhibit production of GlcNAc-Ins. An exemplary 
peptidomimetic is an analog of a peptide having the sequence of one of the SEQ ED NOS: 
19 or 20. 

[0127] Other features and advantages of the invention will be apparent firom the detailed 
description herein, and from the claims. The practice of the present invention wiU employ, 
unless otherwise indicated, conventional techniques of ceU biology, cell culture, molecular 
biology, transgenic biology, microbiology, recombinant DNA, and immunology, which are 
within the skill of the art. Such techniques are explained fully in the Uterature. See, for 
example. Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and 
Maniatis (Cold Spring Harbor Laboratory Press:1989); DNA Cloning, Volumes I and H (D. 
N. Glover ed., 1985); OUgonudeotide Synthesis (M. J. Gait ed., 1984); MulUs et al. U.S. 
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Pat No. 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); 
Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal 
Cells (R. I. Freshney, Alan R. liss. Inc., 1987); Immobilized Cells And Enzymes (IRL 
Press, 1986); B. Perbal, A Practical Guide To Molecular Cloning (1984); tiie treatise. 
Methods In Enzymology (Academic Press, Inc., N.Y.); Gene Transfer Vectors For 
MammaUan Cells (J. H. Miller and M. P. Calos eds., 1987, Cold Spring Harbor 
Laboratory); Methods In Enzymology. Vols. 154 and 155 (Wu et al. eds.). Immunochemical 
Methods In Cell And Molecular Biology (Mayer and Walker, eds.. Academic Press, 
London, 1987); Handbook Of Experimental limnunology, Volumes I-IV (D. M. Weir and 
C. C. Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1986). 

[0128] As used herein, the term "actinomycetes" and "an actinomycete" encompasses 
any bacterium of the order Actinomycetales. 

[0129] As used herdn, tiie term "nucleic acid" refers to polynucleotides such as 
deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The term 
should also be understood to include, as equivalents, analogs of eitiier RNA or DNA made 
jfrom nucleotide analogs, and, as appUcable to the embodiment being described, single 
(sense or anti-sense) and double-stranded polynucleotides. 

[0130] As used herein, the tenns "gene," "recombinant gene" and "gene construct" refer 
to a nucleic acid comprising an open reading frame encoding a cysteine:glucosaminyl 
inositol Ugase, acetyl-CoA:Cys-GlcN-Ins acetyltransferase or MshA glycosyltransferase, 
including both exon and (optionally) intx)n sequences. The term "intiron" refers to a DNA 
sequence present in a given cysteine:glucosaminyl inositol Ugase, acetyl-CoA:Cys-GlcN-Ins 
acetyltiransferase or MshA glycosyltransferase gene that is not translated into protein and is 
generally found betwem exons. 

[0131] "Homology" refers to sequence similarity between two peptides or between two 
nucleic acid molecules. Homology can be determined by comparing a position in each 
sequence which may be aligned for purposes of comparison. When a position in the 
compared sequence is occupied by the same base or amino acid, tiien tiie molecules are 
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homologous at that position. Percent homology between sequences is a function of the 
number of matching or homologous positions shared by the sequences. 

[0132] The sequence data of a test clone is aligned to the sequences in the database or 
databases using algorithms designed to measure homology between two or more sequences. 
Sequence aligmnent methods include, for example, BLAST (Altschul et aL, 1990), BLITZ 
(MPsrch) (Sturrock & Collins, 1993), and FASTA (Person & Lipman, 1988). For example, 
optimal aUgmnent of sequences for aUgning a conq)aiison window may be conducted by the 
local homology algorithm of Smith (Smith and Waterman, Adv Appl Math, 1981; Smith 
and Waterman, J Teor Biol, 1981; Smith and Waterman, J Mol Biol, 1981 ; Smith et al, J 
Mol Evol, 1981), by the homology aUgoment algorithm of Needleman (Needleman and 
Wuncsch, 1970), by the search of similarity method of Pearson (Pearson and Lipman, 
1988), by computerized implementations of these algorithms (GAP, BESTFTT, FASTA, and 
TFASTA in the Wisconsin Genetics Software Package Release 7.0. Genetics Computer 
Group, 575 Science Dr., Madison, WI, or the Sequence Analysis Software Package of the 
Genetics Computer Group. University of Wisconsin, Madison. WI), or by inspection, and 
the best alignment (i.e., resulting in the highest percentage of homology over the 
comparison window) generated by the various methods is selected. 

[01331 The term "transfection" or 'transforming" and grammatical equivalents thereof 
refers to the introduction of a nucleic acid, e.g., an expression vector, into a recipient cell by 
nucleic acid-mediated gene transfer. 'Transformation," as used herein, refers to a process in 
which acell's genotype is changed as a result of the cellular uptake of exogenous DNA or 
RNA, and, for example, the transformed cell expresses a recombinant form of one of 
cysteine:glucosaminyl inositol Ugase, acetyl-CoA:Cys-GlcN-Ins acetyltransferase or MshA 
glycosyltransferase. 

[0134] "Cells" or "cell cultures" or "recombinant host cells" or "host cells" are often 
used interchangeably as wiU be clear from the context. These terms include the immediate 
subject cells that express cysteinerglucosaminyl inositol Ugases, acetyl-CoA:Cys-GlcN-Ins 
acetyltransferases or MshA glycosyltransferases of the present invention, and, of course, the 
progeny thereof It is understood that not all progeny are exactly identical to the parental 
cell, due to chance mutations or difference in environment. However, such altered progeny 
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are included in these tenns, so long as the progeny retain the characteristics relevant to 
those conferred on the originally transformed cell. In the present case, such a characteristic 
might be the abiUty to produce a recombinant MshC polypeptide, MshD polypeptide or 
MshA polypeptide. 

(01351 As used herein, the term 'Vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. The term "expression vector- 
includes plasmids. cosmids or phages, and the like, capable of synthesizing a subject MshC 
polypeptide, MshD polypeptide or MshA polypeptide encoded by the respective 
recombinant gene carried by the vector. Preferred vectors are those capable of autonomous 
repUcation and/expression of nucleic acids to which they are linked. In the present 
specification, "plasmid" and "vector" are used interchangeably as the plasmid is the most 
commonly used form of vector. Moreover, the invention is intended to include such other 
forms of expression vectors which serve equivalent functions and which become known in 
the art subsequently hereto. 

[0136] 'Transcriptional regulatory sequence" is a generic term used throughout the 
specification to refer to DNA sequences, such as initiation signals, enhancers, and 
promoters, as well as polyadenylation sites, which induce or control transcription of protein 
coding sequences with which they are operably Unked. In one embodiment, transcription of 
a recombinant mshQ mshD or mshA gene is under the control of a promoter sequence (or 
other transcriptional regulatory sequence) that controls tiie expression of the recombinant 
gene in a cell-type in which expression is intended. It wiU also be understood that the 
recombinant gene can be under the control of transcriptional regulatory sequences which are 
the same or which are different from tiiose sequences tiiat control transcription of the 
namrally occurring form of the regulatory protein. 

[0137] As used herein, a "transgenic organism" is any organism, preferably a bacteria in 
which one or more of tiie ceUs of the organism contain heterologous nucleic acid introduced 
by way of human intervention, such as by transgenic techniques well known in the art The 
nucleic acid is introduced into the cell by way of deUberate genetic manipulation, such as by 
microinjection or by infection witii a recombinant virus or a vector. The term genetic 
manipulation does not include classical crossbreeding, or in vitro fertihzation, but rather is 
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directed to the introduction of a recombinant DNA molecule. This molecule may be 
integrated within a chromosome, or it may be extrachromosomally repUcating DNA. In the 
typical transgenic organisms described herein, the transgene causes cells to express a 
recombinant form of the subject MshC polypeptides, MshD polypeptides or MshA 
polypeptides. 

10138] As used herein, the term *transgene" means a nucleic acid sequence (encoding, 
e,g., a MshC polypeptide, MshD polypeptide or MshA polypeptide), which is partly or 
entirely heterologous, i.e., foreign, to the transgenic organism or ceU into which it is 
introduced, or, is homologous to an endogenous gene of the transgenic organism or cell into 
which it is introduced, but which is designed to be inserted, or is inserted, into the 
organism's genome in such a way as to alto: the genome of the cell into which it is mserted 
(eg. , it is inserted at a location which differs from that of the natural gene or its insertion 
results in a knockout). A transgene can include one or more transcriptional regulatory 
sequences and any other nucleic acid, such as introns. that may be necessary for optimal 
expression of a selected nucleic acid. 

[01391 The term "evolutionarily related to," with respect to nucleic acid sequences 
encoding MshC polypeptides, MshD polypeptides or MshA polypeptides, refers to nucleic 
acid sequences that have arisen naturally in an organism, including naturaUy occurring 
mutants. The term also refers to nucleic acid sequences which, while derived from a 
naturally occurring MshC polypeptide, MshD polypeptide or MshA polypeptide, have been 
altered by mutagenesis, as for example, combinatorial mutagenesis, yet still encode 
polypeptides which have the Ugase activity of a cysteine:glucosaminyl inositol ligase 
polypeptide, acetyltransferase activity of an acetyl-CoA:Cys-GlcN-Ins acetyltransferase 
polypeptide or glycosyltransferase activity of a MshA glycosyltransferase polypeptide and 
may have such respective activity in modified form, such as modified level of activity or 
conditions of optimal activity, e.g., temperature of optimal activity, and the like. 

[01401 Also provided is an isolated nucleic acid comprising the nucleotide sequence 
encoding a MshC polypeptide, MshD polypeptide or MshA polypeptide, fragments thereof 
encoding polypeptides having MshC. MshD or MshA activity, and/or equivalents of such 
nucleic acids. The term nucleic acid as used herein is intended to include such fragments 
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and equivalents. The tenn equivalent is understood to include nucleotide sequences 
encoding functionally equivalent polypeptides or functionally equivalent peptides having an 
activity of a polypeptide such as described herein. Equivalent nucleotide sequences will 
include sequences that differ by one or more nucleotide substitutions, additions or deletions, 
such as alleUc variants; and wUl also include sequences that differ ftom the nucleotide 
sequence encoding the desired enzyme due to the degeneracy of the genetic code. 
Equivalents will also include nucleotide sequences that hybridize under stringent conditions 
(i.e., equivalent to about 20-27° C below the melting temperature of the DNA duplex 
formed in about 1 M salt) to the nucleotide sequence of a cysteine:glucosaminyl inositol 
ligase gene, such as that as set forth in SEQ ID NO: 1 or 3, particularly those segments 
encoding the polypeptides shown in one of SEQ ID NOS: 7, 8, or 9, or conservative 
variations thereof, nucleotide sequences lhat hybridize under stringent conditions to the 
nucleotide sequence of a acetyl-CoA:Cys-GlcN-Ins acetyltransferase gene, such as that set 
forth in SEQ ID NO: 48 or a MshA glycosyltransferase gene, such as that set forth in SEQ 
ID NO: 49. In one embodiment, equivalents will further include nucleic acid sequences 
derived from and evolutionarily related to such nucleotide sequences. 

[01411 The term "isolated" or ♦'purified" as also used herein with respect to nucleic 
acids, such as DNA or RNA, refers to molecules separated from other DN As, or RNAs. 
respectively, that are present in the natural source of the macromolecule. For example, an 
isolated nucleic acid encoding one of the subject MshC. MshD or MshA polypeptides 
includes, in one embodiment, no more than 10 kilobases (kb) of nucleic acid sequence 
which naturally immediately flanks the mshQ mshD or msM gene in genomic DNA, in 
another embodiment no more than 5 kb of such naturally occurring flanking sequences, and 
in still another embodiment less than 1.5 kb of such naturally occurring flanking sequence. 
The tenn isolated or purified as used herein also refers to a nucleic acid or peptide that is 
substantially free of cellular material or culture medium when produced by recombinant 
DNA techniques, or chemical precursors or other chemicals when chemically synthesized. 
Moreover, an "isolated nucleic acid" is meant to include nucleic acid fragments that are not 
naturally occurring as fiagments and would not be found in the natural state. 
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101421 In yet another embodiment, the nucleic acid set forth above encodes a peptide 
having an amino acid sequence as shown SEQ ID NO: 2. The nucleic acids encode a 
peptide havmg cysteine.glucosaminyl inositol Ugase polypeptide activity and being 35% or 
more homologous, in another embodiment. 40% homologous and m still another 
embodiment, 45% homologous with an amino acid sequence as set forth in SEQ ID NO: 2 
(encoded by a nucleic acid sequence as set forth in SEQ ID NO: 1). Nucleic acids that 
encode peptides having an activity of cysteine:glucosaminyl inositol Ugase polypeptide are 
also within the scope of the invention. Further, the nucleic acid of the invention encodes a 
peptide having an amino acid sequence as shown SEQ ID NO: 14 or 15. In stiU another 
embodiment, the invention provides a nucleic acid that encodes a peptide having an amino 
acid sequence as shown SEQ ID NO: 19 or 20. 

[01431 Another aspect of the invention provides a nucldc acid which hybridizes under 
high or low stringency conditions to a nucleic acid which encodes a cysteine:glucosaminyl 
inositol ligase polypeptide having all or a portion of an amino acid sequence shown in one 
of SEQ ID NOS: 2, 4, 7, 8 or 9. Similarly, the invention provides a nucleic acid which 
hybridizes under high or low stringency conditions to a nucleic acid which encodes an 
acetyl-CoA:Cys-GlcN-Ins acetyltransferase polypeptide having all or a portion of an amino 
acid sequence shownin one of SEQ ID NOS: 14or 15. Additionally, the invention 
provides a nucleic acid which hybridizes under high or low stringency conditions to a 
nucleic acid which encodes a MshA glycosyltransferase polypeptide having all or a portion 
of an amino acid sequence shown in one of SEQ ID NOS: 19 or 20. Appropriate stringency 
conditions which promote DNA hybridization, for example, 6.0X sodium chloride/sodium 
citrate (SSC) at about 45- C, followed by a wash of 2.0X SSC at 50" C, are known to those 
sMUed in the art or can be found in Current Protocols in Molecular Biology, John Wiley & 
Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt concentration in the wash step can be 
selected from a low stringency of about 2.0X SSC at 50° C. to ahigh stiingency of about 
0.2X SSC at 50° C. In addition, tiie temperature in the wash step can be increased from low 
stringency conditions at room temperature, about 22° C, to high stiingency conditions at 
about 65° C. 
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[01441 Isolated nucleic acids that differ j&om the nucleotide sequences disclosed herein 
due to degeneracy in the genetic code are also within the scope of the invention. For 
example, more than one triplet designates a number of amino acids. Codons that specify the 
same amino acid, or synonyms (for example, CAU and CAC are synonyms for histidine) 
may residt in "silent" mutations that do not affect the amino acid sequence of the protem. 
However, it is expected that DNA sequence polymorphisms that do lead to changes in the 
amino acid sequences of the subject MshC. MshD or MshA polypeptides wiU exist among 
prokaryotic cells. One skilled in the art will appreciate that these variations in one or more 
nucleotides (up to about 3-4% of the nucleotides) of the nucleic acids encoding a particular 
member of the MshC, MshD or MshA polypeptide family may exist among individuals of a 
given species due to natural alleUc variation. Any and all such nucleotide variations and 
resulting amino acid polymorphisms are within the scope of this invention. 

101451 Fragments of the nucleic acid encoding a biologically active portion of the 
subject MshC. MshD or MshA polypeptides are also within the scope of the invention. As 
used herein, a fragment of tiie nucleic acid encoding an active portion of a MshC, MshD or 
MshA polypeptide refers to a nucleotide sequence having fewer nucleotides than the 
nucleotide sequence encoding the full length amino acid sequence of, for example, the 
Ugase polypeptides represented in nucleic acid SEQ ID NO: 1, and which encodes a peptide 
which retains at least a portion of the biological activity of the fiiU-length protein (i.e., a 
peptide capable of cysteine:glucosaminyl inositol Ugase activity) as defined herein, or 
alternatively, which is functional as an antagonist of the activity of the fuU-lengtti protein. 
Nucleic acid fragments within the scope of the invention include those capable of 
hybridizing under high or low stringency conditions with nucleic acids from other species, 
e.g. for use in screening protocols to detect homologs of the subject MshC, MshD or MshA 
polypeptides. Nucleic acids within the scope of the mvention may also contain linker 
sequences, modified restriction endonuclease sites and other sequences useful for molecular 
cloning, expression or purification of such recombinant peptides. 

[01461 As indicated by the examples set out below, a nucleic acid encoding a peptide 
having an activity of a mycothiol biosynthesis Ugase, acetyltransferase or 
glycosyltransferase polypeptide may be obtained from mRNA or genomic DNA present in 
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any of a number of antibiotic-producing or pathogenic bacteria, particularly actinomycetes, 
in accordance with protocols described herein, as weU as those generaUy known to those 
skilled in the art. A cDNA encoding a MshC, MshD or MshA polypeptide, for example, 
can be obtained by isolating total mRNA fiom a bacterial cell. Double stranded cDNAs can 
then be prepared firam the total mRNA, and subsequentiy inserted into a suitable plasmid or 
bacteriophage vector using any one of a number of known techniques. A gene encoding a 
MshC, MshD or MshA polypeptide can also be cloned using estabUshed polymerase chain 
reaction techniques in accordance witii tiie nucleotide sequence information provided by the 
invention. 

[01471 Anottier aspect of the invention relates to the use of an "anti-sense" isolated 
nucleic acid. As used herein, an "anti-sense" inhibition of endogenous production of a 
MshC, MshD or MshA molecule is carried out by administiation or in situ generation of 
oUgonucleotide probes or their derivatives which specifically hybridize bind) under 
intracellular conditions, witii the ceUular mRNA and/or genomic DNA encoding a MshC, 
MshD or MshA polypeptide so as to inhibit expression of that protein or a constitiient 
thereof, e.g. by inhibiting transcription and/or translation. The binding may be by 
conventional base pair complementarity, or, for example, in tiie case of binding to DNA 
duplexes, through specific interactions in the major groove of the double helix. In general, 
"anti-sense" therapy refers to the range of techniques generally employed in the art, and 
includes any therapy tiiat reUes on specific binding to oligonucleotide sequences. 

[01481 An anti-sense construct of the present invention can be delivered, for example, as 
an expression plasmid which, when transcribed in the transformed cell, produces RNA 
which is complementary to at least a unique portion of the cellular mRNA tiiat encodes a 
MshC, MshD or MshA polypeptide. Alternatively, the anti-sense construct is an 
oUgonucleotide probe that is generated ex vivo and which, when introduced into the cell, 
causes inhibition of expression by hybridizing with tiie mRNA and/or genomic sequences 
encoding one of tiie subject MshC, MshD or MshA proteins. Such oUgonucleotide probes 
are preferably modified oUgonucleotides tiiat are resistant to endogenous nucleases, e.g. 
exonucleases and/or endonucleases, and is tiierefore stable in vivo. Exemplary nucleic acid 
molecules for use as anti-sense oUgonucleotides are phosphoramidate. phosphotiiioate and 
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methylphosphonate analogs of DNA (see also U.S. Pat. Nos. 5.176,996; 5,264,564; and 
5,256,775). Additionally, general approaches to constructing oUgomers useful in anti-sense 
techniques have been reviewed, for example, by van der Krol et al. (1988) Biotechniques 
6:958-976; and Stein et al. (1988) Cancer Res 4^:2659-2668. 

10149] In addition, the oligomers of the invention may be used as reagents to detect the 
presence or absence of the target DNA or RNA sequences to which they specifically bind. 
Such diagnostic tests are described in further detail below. 

[0150] This mvention also provides expression vectors comprising a nucleotide sequence 
encoding a member of the femiUes of MshC, MshD or MshA polypeptides and operably 
linked to at least one regulatory sequence. Operably linked is intended to mean that the 
nucleotide sequence is linked to aregulatory sequence in amanner that allows expression of 
the nucleotide sequence. Regulatory sequences are recognized in the art and are selected to 
direct expression of the peptide having an activity of a MshC, MshD or MshA polypeptide. 
Accordingly, the term regulatory sequence includes promoters, enhancers and other 
expression control elements. Exemplary regulatory sequences are described in Goeddel; 
Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego. 
CaUf. (1990). For instance, any of a wide variety of expression control sequences, 
sequences that control the expression of a DNA sequence when operatively Imked to it, may 
be used in these vectors to express DNA sequences encoding mycothiol biosynthesis 
cysteine:glucosaminyl inositol ligase polypeptides. Such useful expression control 
sequences, include, for example, the early and late promoters of S V40, adenovirus or 
cytomegalovirus immediate early promoter, the lac system, the trp system, the TAG or TRC 
system, T7 promoter whose expression is directed by T7 RNA polymerase, the major 
operator and promoter regions of phage lambda, the control regions for fd coat protein, the 
promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid 
phosphatase, e.g., Pho5, the promoters of the yeast a-mating factors, the polyhedron 
promoter of the baculovirus system and other sequences known to control the expression of 
genes of prokaryotic or eukaryotic cells or their viruses, and various combinations thereof! 
It should be understood that the design of the expression vector may depend on such fectors 
as the choice of the host cell to be transformed and/or the type of protein desired to be 
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expressed. Moreover, the vector's copy number, the abiUty to control that copy number and 
the expression of any other proteins encoded by the vector, such as antibiotic markers, 
should also be considered. 

[01511 As will be apparent, the subject gene constructs can be used to cause expression 
of the subject MshC, MshD or MshA polypeptides in cells propagated in culture, e.g. to 
produce proteins or peptides, including fusion proteins or peptides, for purification. 

[01521 This invention also pertains to a host cell transfected with a recombinant mshC, 
mshD or mshA gene in order to express a polypeptide having an activity of a MshC, MshD 
or MshA polypeptide. The host cell may be any prokaryotic or eukaryotic cell. For example, 
a MshC, MshD or MshA polypeptide may be expressed in bacterial cells such as E. coli, 
insect cells (baculovirus), yeast, or mammalian cells. Other suitable host cells are known to 
those skilled in the art. 

[01531 Another aspect of the present invention concerns recombinant MshC, MshD or 
MshA polypeptides that have the Ugase activity of a cysteine: glucosaminyl inositol Ugase 
polypeptide, acetyltransferase activity of an acetyl-CoA:Cys-GlcN-Ins acetyltransferase 
polypeptide or glycosyltransferase activity of a MshA glycosyltransferase polypeptide, or 
which are naturally occurring mutants thereof. The tenn "recombinant protein" refers to a 
protein of the present invention that is produced by recombinant DNA techniques, wherein 
generally DNA encoding the MshC polypeptide, MshD polypeptide or MshA polypeptide is 
inserted into a suitable expression vector which is in turn used to transform a host cell to 
produce the heterologous protein. Moreover, the phrase "derived ftom," with respect to a 
recombinant gene encoding the recombinant MshC polypeptide. MshD polypeptide or 
MshA polypeptide, is meant to include within the meaning of "recombinant protein" those 
proteins having an amino acid sequence of a native MshC polypeptide, MshD polypeptide 
or MshA polypeptide, or an amino acid sequence similar thereto which is generated by 
mutations including substitutions and deletions of a naturally occurring MshC polypeptide, 
MshD polypeptide or MshA polypeptide of a organism. 

[01541 The present invention further pertains to methods of producing the subject MshC, 
MshD or MshA polypeptides. For example, a host cell transfected with expression vector 
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encoding one of the subject MshC, MshD or MshA polypeptides can be cultured under 
appropriate conditions to allow expression of the peptide to occur. The peptide may be 
secreted and isolated ftom a mixture of cells and medium containing the peptide. 
Alternatively, the peptide may be retained cytoplasmically and the cells harvested, lysed 
and the protein isolated. A cell culture includes host cells, media and other byproducts. 
Suitable media for ceU culture are well known in the art The peptide can be isolated from 
cell culture medium, host cells, or both using techniques known in the art for purifying 
proteins including ion-exchange chromatography, gel filtration chromatography, 
ultrafiltration, electrophoresis, and immunoaffinity purification with antibodies, such as 
invention antibodies, specific for particular epitopes of the subject MshC, MshD or MshA 
polypeptides. 

[0155] Thus, a nucleotide sequence derived fix>m the cloning of a MshC, MshD or MshA 
polypeptide of the present invention, encoding all or a selected portion of the protein, can be 
used to produce a recombinant form of the protein via microbial cellular processes. 

10156) The recombinant MshC. MshD or MshA polypeptide can be produced by Ugating 
the cloned gene, or a portion thereof, into a vector suitable for expression in bacterial cells. 
Expression vehicles for production of a recombinant cysteine-.glucosaminyl inositol Ugase 
polypeptide include plasmids and other vectors. For instance, suitable vectors include 
plasmids of the types: pBR322-derived plasmids. pEMBL-derived plasmids, pEX-derived 
plasmids, pBTac-derived plasmids and pUC-derived plasmids for expression in prokaryotic 
cells, such as E. coli. 

[0157] A number of vectors exist for the expression of recombinant proteins in yeast. 
For instance, YEP24, YIPS, YEP51. YEP52. pYES2, and YRP17 are cloning and 
expression vehicles usefiil in the introduction of genetic constructs into S. cerevisiae (see, 
for example. Broach et al. (1983) in Experimental Manipulation of Gene Expression, ed. M. 
Inouye Academic Press, p. 83. incorporated by reference herein). These vectors can 
repUcate in E. coli due the presence of the pBR322 ori. and in S. cerevisiae due to the 
• repUcation detemiinant of the yeast 2 micron plasmid. hi addition, drug resistance markers 
such as ampicillin can be used. 
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[0158] The various methods employed in the preparation of the plasmids and 
transformation of host organisms are well known in the art. For other suitable expression 
systems for both prokaryotic and eukaryotic cells, as well as general recombinant 
procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, 
Fritsch and Maniatis (Cold Spring Harbor Laboratory Press:1989) Chapters 16 and 17. In 
some instances, it may be desirable to express the recombinant MshC, MshD or MshA 
polypeptide by the use of a baculovirus expression system. Examples of such baculovirus 
expression systems include pVL-derived vectors (such as pVL1392, pVL1393 and 
pVL941). pAcUW-derived vectors (such as pAcUWl), and pBlueBac-derived vectors (such 
as the P-gal containing pBlueBac HI). 

[0159] This invention further contemplates amethod of generating sets of combinatorial 
mutants of the present MshC, MshD or MshA polypeptides, as well as truncation mutants, 
and is especially useful for identifying potential variant sequences (e.g. homologs) that are 
functional in ligation of cysteine to glucosaminyl inositol or a derivative thereof, functional 
in acetylation of Cys-GlcN-Ins by acetyl-CoA or functional in formation of GlcNAc-Ins. In 
a representative embodiment of this method, the amino acid sequences for a population of 
MshC, MshD or MshA polypeptide homologs are aligned. One effect of such alignment is 
to promote the highest homology possible. Such a population of variants can include, for 
example, homologs from one or more species, or homologs from the same species but 
which differ due to mutation. Amino acids that appear at each position of the aligned 
sequences are selected to create a degenerate set of combinatorial sequences. The presence 
or absence of amino acids from an aUgned sequence of a particular variant is relative to a 
chosen consensus length of a reference sequence, which can be real or artificial. In order to 
maintain the highest homology in aUgnment of sequences, deletions in the sequence of a 
variant relative to the reference sequence can be represented by an amino acid space (*), 
while insertional mutations in the variant relative to the reference sequence can be 
disregarded and left out of the sequence of the variant when aligned. 

[0160] Further racpansion of the combinatorial library can be made by, for exanq>le, by 
including amino acids that would represent conservative mutations at one or more of the 
degenerate positions. Inclusion of such conservative mutations can give rise to a library of 
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potential MshC, MshD or MshA sequences. Alternatively, amino acid replacement at 
degenerate positions can be based on steric criteria, e.g. isosteric replacement, without 
regard for polarity or charge of amino acid side chains. Similarly, completely random 
mutagenesis of one or more of the variant positions can be carried out. 

[0161] In one embodiment, the combinatorial MshC, MshD or MshA Ubrary is produced 
by way of a degenerate Ubrary of genes encoding a Ubrary of polypeptides that each include 
at least a portion of potential MshC, MshD or MshA polypeptide sequences. For instance, a 
mixture of synthetic oUgonucleotides can be enzymatically Ugated into gene sequences such 
that the degenerate set of potential MshC, MshD or MshA nucleotide sequences are 
expressible as individual polypqptides, or alternatively, as a set of larger fusion proteins 
{e.g. for phage display) containing the set of MshC, MshD or MshA polypeptide sequences 
therein. 

101621 There are many ways by which the Ubrary of potential MshC, MshD or MshA 
homologs can be generated fiom a degenerate oUgonucleotide sequence. Chemical 
synthesis of a degenerate gene sequence can be carried out in an automatic DNA 
synthesizer, and the synthetic genes then be Ugated into an appropriate gene for expression. 
The purpose of a degenerate set of genes is to provide, in one mixture, all of the sequences 
encoding the desired set of potential MshC, MshD or MshA sequences. The synthesis of 
degenerate oUgonucleotides is well known in the art (see for example, Narang, SA (1983) 
Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. 
Macromolecules, ed. A G Walton, Amsterdam: Elsevier pp. 273-289; Itakura et al. (1984) 
Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) 
Nucleic Acid Res. 1 1 'All. 

[01631 A wide range of techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations, and, for that matter, for screening cDNA 
Ubraries for gene products having a certain property. Such techniques will be generaUy 
adaptable for rapid screening of the gene Ubraries generated by the combinatorial 
mutagenesis of MshC, MshD or MshA homologs. The most widely used techniques for 
screening large gene Ubraries typicaUy comprises cloning the gene Ubrary into repUcable 
expression vectors, transforming appropriate ceUs with the resulting Ubrary of vectors, and 
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expressing the combinatorial genes under conditions in which detection of a desired activity 
facilitates relatively easy isolation of the vector encoding the gene product was detected. 
Each of the illustrative assays described below are amenable to high throughput analysis as 
necessary to screen large numbers of degenerate sequences created by combinatorial 
mutagenesis techniques. 

[01641 The invention also provides for reduction of the subject MshC, MshD or MshA 
polypeptides to generate mimetics, e.g. peptide or non-peptide agents, which are able to 
mimic binding of the authentic cysteine-.glucosaminyl inositol Ugase polypeptide, acetyl- 
CoA:Cys-GlcN-Ins acetyltransferase polypeptide or MshA glycosyltransferase polypeptide 
to a substrate molecule. Such mutagenic techniques as described above, as weU as the 
thioredoxin system, are also particularly useful for mapping the determinants of a MshC, 
MshD or MshA polypeptide which participate in protein-protein interactions involved in, 
for example, binding of the subject cysteme.glucosaminyl inositol Ugase polypeptide. 
acetyl-CoA:Cys-GlcN-Ins acetyltransferase polypeptide or MshA glycosyltransferase 
polypeptide to a substrate. To illustrate, the critical residues of a subject, for example. 
cysteine:glucosaminyl inositol Ugase polypeptides which are involved in molecular 
recognition of substrate can be determined and used to generate cysteine:glucosaminyl 
inositol Ugase-derived peptidomimetics that catalyticaUy Unk cysteine or a cysteine 
derivative to glucosaminyl inositol or other derivative of glucosaminyl inositol. By 
employing, for example, scanning mutagenesis to map the amino acid residues of a 
particular cysteine-.glucosaminyl inositol Ugase polypeptide involved in binding to a 
substrate, peptidomimetic compounds (e.g. diazepine or isoquinoUne derivatives) can be 
generated which mimic those residues that Unk cysteine. For instance, non-hydrolyzable 
peptide analogs of such residues can be generated using benzodiazepine (e.g.. see Freidinger 
et al. in Peptides: Chemistry and Biology, G. R. MarshaU ed., ESCOM PubUsher: Leiden. 
Netherlands. 1988), azepine {eg., see Huf&nan et al. in Peptides: Chemistry and Biology, 
G. R. Marshall ed.. ESCOM PubUsher: Leiden, Netherlands, 1988), substituted gama lactam 
rings (Garvey et al. in Peptides: Chemistry and Biology, G. R. Marshall ed., ESCOM 
PubUsher: Leiden, Netherlands, 1988). keto-methylene pseudopeptides (Ewenson et al. 
(1986) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure and Function 
(Proceedings of the 9th American Peptide Synq)osium) Pierce Chemical Co. Rockland, Dl.. 
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1985), beta-tum dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et 
al. (1986) J Chem Soc Perkin Trans 1:1231), and p-aminoalcohols (Gordon et al. (1985) 
Biochem Biophys Res Commun 126:419; and Dann et al. (1986) Biochem Biophys Res 
Commvm 134:71). Similar methods are applicable to acetyl-CoA:Cys-GlcN-Ins 
acetyltransferase polypeptides and MshA glycosyltransferase polypeptides. 

(01651 Purification of the Cys:GlcN-]ns Ugase (SEQ ID NO: 2) from M. smegmatis 
proved difficult and many preliminary studies were needed to finally obtain sufficiently 
pure protein for N-tenninal sequencing. The major problems encountered were multiple 
interconverting forms of the enzyme activity which eluted separately on chromatography 
and instabiUty of the enzyme activity in purification buffers. The method which was 
eventually successful depended upon selection of &e correct peak of activity at each stage 
of purification and intensive effort to complete the protocol before activity was lost 

101661 The first purification step (Table 1) involved a 15-50% ammonium sulfate 
fi^ctionation and this resulted in a 6.5-fold increase in specific activity with very Uttle loss 
of protein. After desalting on G-25 this material was chromatographed on DEAE 650-M. 
The elution profile (Figure 3 A) revealed three major peaks of activity, the major portion of 
the activity residing in the first and third peaks. Experience revealed that the third peak was 
more readily susceptible to further purification and the central firactions from this peak were 
combined for chromatography on hydroxyl-apatite (Figure 3B). Activity eluted primarily in 
one broad peak. The most active firactions were combined and bound to a Reactive Brown- 
10 dye affinity column at pH 6.5. After washing off unbound protein the column was eluted 
by increasing the pH to 8.0 over 5 column volumes (Figure 3C). The activity eluted at pH 8 
in a sharp but tailing peak. 

[01671 The purification was monitored by SDS-PAGB and two closely migrating 
proteins of Mr ~ 47000 were detected. The smaller protein was submitted for N-tenninal 
amino acid sequencing as described in Example 4 and the sequence 
MQSWSAPAIPWPGRGPALR (SEQ ID NO: 7) was obtained for the N-terminal 20 
amino acids. When the smaller protein of SEQ ID NO: 7 was BLAST searched against the 
TIGR website of unfinished M smegmatis genomic sequence, a 100% identity match was 
obtained with the N-tenninal sequence for an open reading fismae found on contig 3267 
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(corresponding to nucleic acids 1222384-1223620) and encoding a protein of 412 amino 
acid residues. A BLAST search of the N-tenninal sequence (SEQ ID NO: 7) against the M. 
tuberculosis genome database (S.T. Cole et al. (1998) Nature 393, 537-544) on TubercuUst 
data base retrieved the sequence assigned as CysS2 (Rv2130c) (SEQ ID NO: 4) as the only 
one with significant N-terminal sequence correspondence and revealed a 70% identity and 
80% positive homology between Ihe N-terminal 20 amino acid residues of MshC of M 
smegmatis (SEQ ID NO: 7) and those of M tuberculosis. The full sequences for the two 
proteins are compared in Figure 4. 

[0168] Sequencing of the larger protein produced the sequence 

(G/S/M)(E/Q)HLKVDAMQSW(S/DP)APAIP (SEQ ID NO: 8). which overlaps that of the 
smaller protein. Comparison of Ihe M smegmatis upstream from the terminal Met (SEQ ID 
NO: 1) 6f the smaller protein detected a matching sequaice SEHLKVDAMQSWSAPAIP 
(SEQ ID NO: 9) However, a search of Tuberculist database for a sequence matching SEQ 
ID NO: 9 found no match other than M. tuberculosis cysS2 and the upstream residues of this 
gene are not homologous to those of the M smegmatis mshC gene. Thus, the Ugase of M 
smegmatis may be translated in two forms, one having an N-terminal extension of 8 
residues starting with Ser and the other having Met as the N-terminal residue. Alternatively 
the larger protein may have been translated and partially processed by proteases. 

[0169] Active enzyme fractions were utilized to estimate and values for Cys and 
GlcN-Ins from Eadie-Hofstee plots. The invention Ugase was found to have values of 
40 ± 3 and 72 ± 9 /M for Cys and GlcN-his, respectively. 

[0170] Confirmation that cysS2 of M tuberculosis codes for MshC was obtained by 
using PCR to clone the gene into pRSETA. Sequencmg of the cloned DNA verified that the 
cloning was accurate. The Hiss-tagged protein was expressed in E. coli after induction with 
isopropyl-p-D-thiogalactopyranoside. Assay of the crude extract with 1 mM ATP, 0.1 mM 
Cys, and 50 pM GlcN-Ins gave 0.12 nmol min"* mg-^ of Ugase activity whereas analogous 
measurements on E. coli transformed with the blank vector yielded no measurable Ugase 
activity (<0.01 nmol min'* mg*^). 
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101711 In another embodiment, the present invention provides mutants of mycothiol- 
producing bacteria that are constructed to be deficient in MSH production. One chemical 
mutant (164) and two transposon mutants (Tnl and Tn2) of M. smegmatis stably deficient in 
MSH production were isolated by screening for reduced levels of MSH content. The MSH 
content of transposon mutants Tnl and Tn2 was found to be less than 0.1% that of the wild 
type strain mcMsS and the MSH content of chemical mutant 164 was found to be 1% of the 
wild type strain. All three strains accumulated GlcN-lns at a level 20-25-fold higher than 
the level found in the wild type parent strain. The L-cysteine: iD-myo-inosityl 2-amino-2- 
deoxy-a-D-glucopyranoside Ugase (MshC) activity was <2% that of the parent strain for the 
three mutant strains. 

[0172] Members of the actinomycetes family, including the genus mycobacterium, 
exhibit innate resistance to many currently available antimicrobial drugs. This phenomenon 
is generally ascribed to their cell wall impermeabiUty. In Examples 6-9 below, it is shown 
that MSH deficient mutants in which the mshC gene has been disrupted have increased 
sensitivity to a wide variety of agents including antibiotics, oxidizing agents and alkylating 
agents. Phenotypic analysis of the invention mutants revealed that these MSH deficient 
mutants possess increased sensitivity to firee radicals and alkylating agents and to a wide 
range of antibiotics including eryfliiomycin, azithromycin, vancomycin, penicillin G, 
rifemycin and rifampin. Conversely, the mutants possess at least 200-fold more resistance to 
isoniazid compared to wild type. 

[0173] The demonstration that mycothiol depleted bacteria are sensitive to antibiotics is 
supported by earlier identification of the mycothiol-dependent detoxification system (G.L. 
Newton et al. (2000a). supra). The production of mycothiol in wild type species may 
account for tiie natural abiUty of actinomycetes and especially mycobacteria to resist a wide 
range of antibiotics. 

[0174] It might be argued that one of the precursors to MSH is needed in an as yet 
unidentified process to confer resistance to peroxide (Figure 6). However, the present 
studies show tiwt mutants blocked at production of MshC still produce both GlcNAc-Ins 
and GlcN-Ins but are nevertheless higihly sensitive to hydrogen peroxide. Thus, tiie 
possibiUty that these intermediates are associated with peroxide resistance is eliminated. 
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This leaves the possibiUty that Cys-GlcN-Ins, rather than MSH, might be the key to 
peroxide resistance. However. Cys-GlcN-Ins is present in wild type M. smegmatis at almost 
undetectable levels (Anderberg. et al. (1998) J. Biol. Chem. 273:30391-30397). which 
makes it unlikely that Cys-GlcN-Ins could play a significant role in peroxide resistance. 
Thus. MSH, rather than one of its intennediates. remains as the key to antibiotic and 
peroxide sensitivity. Accordingly, the invention MSH minus mutants are generally more 
sensitive to peroxide and many antibiotics. 

[0175] Although having increased sensitivity to various antibiotics, the MSH-deficient 
mutants of M. smegmatis described herein are extremely resistant to isoniazid. an 
antimycobacterial prodrug that needs to be activated inside mycobacteria to form a reactive 
intermediate of yet undetermined structure (JL Johnsson et al.(1991) J. Biol. Caiem. 
272:2834-2840). Since the MSH mutants have other low molecular weight thiols such as 
cysteine, and coenzyme A in their reduced fonn (data not shown), these mutants cannot be 
described as free of low molecular weight thiols. Thus, the INH resistance is probably 
directly derived from lack of mycothiol, and not just a lack of low molecular weight thiol 
reductants in general. Thus, the results of the studies presented herein confirm and expand 
the correlation between mycothiol depletion and sensitivity of members of the 
actinomycetes family, including the genus mycobacterium to toxins and antibiotics, thus 
supporting the notion that inhibition of MSH metaboUsm, in particular inhibition of MshC, 
can be an attractive drug target for novel inhibitory confounds. In addition, an inhibitor of 
MshC activity in such bacteria can be used in combination with antibiotics, including those 
not currentty in use against mycobacteria, in treatment of diseases that are associated wiih 
mycobacteria, such as Rifampin, rifamycin, erythromycin, azithromycin, vancomycin, arid 
the like. 

10176] The isolation of MshD included an initial screening for isoniazid resistance in the 
production of a mutant Ubrary enriched in MSH-deficient mutants, as set forth below in 
Example 10. Thelibrary contained 3 mutants with <1% MSH per 200 tested. InanearUer 
study using chemical mutagenesis with AT-methyl-JV-nitro-iV-nittosoguamdine and 
screening for slow growth on plates containing diamide one mutant was found to be devoid 
of MSH (mutant 49) in 415 tested (Newton, G.L., Unson, M., Anderberg, S., Aguilera. J.A., 
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Oh, N.N., delCardayre, S., Davies, J., Av-Gay, Y., Fahey, R.C. (1999) Biochem. Biophys. 
Res. Comm. 255:239-244.). Thus, the piesent method appears to produce a 6-fold higher 
MSH-deficient mutant density which makes identification of actual MSH-deficient mutants 
by HPLC analysis more feasible. 

[0177] The mshD: :Tn5 mutant is usefiil for exploring various phenomena associated 
with the function of MSH. These include the exploration of the role of MSH in maintaining 
the cellular reducing environment in conjunction with mycothiol disulfide reductase (Patel, 
M.P., Blanchard, J.S. (1999) Biochem. 38:11827-11833.) and in the detoxification of 
alkylating agents and other cellular poisons (Misset-Smits, M., van Ophem, P.W., Sakuda, 
S., Duine, J.A. (1997) FEES Lett 409:221-222; Newton, G.L., Av-Gay. Y., Fahey. R.C. 
(2000) Biochem. 35:10739-10746; Norin, A, Van Ophem, P.W., Piersma. S.R., Persson, B., 
Duine, J.A.. Jomvall, H. (1997) Eur, J. Biochem. 248:282-289.). Since mshDi.TnS 
accumulates all of the intermediates in the MSH biosynthetic pathway but does not produce 
MSH, comparison of the phenotype of this mutant with those of mutants blocked in other 
steps of MSH biosynthesis allows an assessment of whether or not the intermediates m 
MSH biosynthesis have other functions. But very key uses of mshD::TQ5 have already been 
demonstrated below in Example 10, as providing a source of Cys-GlcN-Ins and as used in 
identification of the mshD gene. 

[0178] The role ofmycothiol in the activation ofihe prodrug isoniazid in MsmegTwa/w 
was also examined. Mycothiol mutants in the mshA gene (Newton, G.L., Av-Gay, Y., 
Fahey. R.C. (2000) J. Bacterial. 182:6958-6963; Newton, et al, 1999. supra.) , the mshC 
gene (Rawat, M., Newton, G.L.. Ko, M., Martinez, G.J., Fahey. R.C., Av-Gay, Y. (2002) 
Antimicrob. Agents Chemother., in press.) and the mshD gene are all highly resistant to 
isoniazid. The liansposon mutant mshD: :Tn5 presumably must, Uke the wild-type strain, 
have intact isoniazid target genes and it produces substantial levels of a thiol, Cys-GlcN-Ins, 
having very similar structure to mycothiol. MSH is therefore specifically required, either 
directiy or indirectly, for activation of isoniazid. This result is consistent with the earUer 
finding that restoring MSH content &om 0% to 31% by complementation of chemical 
mutant 49 markedly reduced its isoniazid resistance (Newton, et a/., 1999, supra). 
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10179] The MshD acetyltransferase belongs to the large GCN5-related N- 
acetyltransferase famUy ( Dyda, F., Kldn, D.C., Hickman, A.B. (2000) ^««u. Rev. Biophys. 
Biomol Struct 29:81-103) but has some unusual characteristics. The sequence includes two 
separate regions with similarity to the pfam00583 domain characterizing acetyltransferases, 
with the C-teraunal re^on having the highest degree of similarity (Figure 8). Aside from 
the orthologs found in other actinomycetes the next most closely related proteins found in a 
GenBank search using Rv0819 are acetyltransferases which acetylate an N-temiinal Ala 
residue of ribosomal proteins, such as the E. coli protein derived from the rzm/ gene 
(Accession Number AAA97269). These proteins are half or less the length of MshD and a 
sequence match with higher sequence similarity occurs with the second acetyltransferase 
domain. These features suggest that the mshD gene is the product of a gene dupUcation and 
that it is the C-terminal domain that is involved in acetyl-CoA binding. 

[01801 The mshB, mshC and mshD genes are widely separated in the M. tuberculosis 
genome and do not form a recognizable biosynthetic operon. It should be noted that 
Rv0818. a gene upstream to mshD (Rv0819). is a putative regulatory gene. In another 
complete actinomycete genome. S. coelicolor A3(2) (Sanger Institute, available on the 
worldwide web at sanger.ac.uk/Projects/S_coeUcolor), the mshB, mshC, and mshD genes 
are identified by homology to those of M. tuberculosis as SC9E12.1 1 (Accession no. 
CAC05756). SCI52.05C (Accessionno. CAC36366), and SCD84.18c (Accession no. 
CAB88484), respectively. These genes are also widely dispersed throughout the genome. 

[0181] The identification of the mycothiol biosynthesis genes establishes the basis for 
production of MSH biosynthesis gene knockouts in M. tuberculosis. Such knockouts can be 
used in determining the role of MSH in the virulence of M tuberculosis. 

[0182] As estabUshed in Example 1 1, the mshA gene in M. smegmatis is essential for the 
production ofGlcNAc-Ins and thereby for the synthesis of MSH. TheM tuberculosis 
mshA gene, Rv0486, has been previously assigned by Campbell, et al., Biochem. J. 
326:929-39, 1997 (who Usted it as unknown protein MTCY20G9.12) as a member of 
glycosyltransferase family 4 in the CAZy database (displayed on the world wide web at 
http://afinb.cnrs-mrs.fr/CAZY/). This femily includes a number of sucrose synthases, 
sucrose phosphate synthases (Sps), mannosyl transferases, and GlcNAc transferases. 
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Homologs for MshA were found in sucrose and sucrose phosphate synthases and with the 
mannosyl transferase PimB (Figure 11). Figure 11 includes sequences for SpsA from 
Anabaena sp. PCC7120 (SEQ ID NO: 21) and PimB (Rv0557) from M. tuberculosis (SEQ 
ID NO: 22), with GenBank accession numhers AJ302071 and NP_215071. respectively. 
SpsA has 29% overall sequence identity withM. tuberculosis MshA whereas PimB has 
24% identity. Cumino. et al.. FEBS Lett. 517:19-23, 2002, have identified two highly 
conserved motifs present in the glucosyltransferase domains of sucrose-phosphate synthases 
and sucrose synthases, designated Box I and Box H as shown in Figure 1 1. The M. 
tuberculosis MshA is identical to SpsA in 6 out of 12 Box I residues and in 17 out of 25 
BoxHresidues. The first Gly of Box I is changed to Asp in M ^ymegTnari* mutant 49. This 
results in complete loss of the abihty to synthesize GlcNAcIns, GlcN-Ins and MSH in vivo, 
showing that this region is critical to the activity of MshA. The Box n domain contains the 
E-X7-E signature (Geremia, et al., Biochem. J. 318:133-8 (1996).) universally found in the 
retaining glycosyl transferases of CAZy femily 4 and for which the first residue, E^" (M. 
tuberculosis numbering), has been shown to be essential for activity of the mannosyl 
transferee AceA from Acetobacter xylinum. (Abdian, P.L.. et al., J. Biol. Chem. 
275:40568-75 (2000).) PimB is less simUar to SpsA than MshA but there are some 
additional residues identified by Kremer, et al. (Biochem. J. 363:437-47, 2002) as highly 
conserved in mannosyl transferases that are conserved in MshA and SpsA. These include 
Lys"^ which has also been shown to be essential for activity in AceA (Abdian, supra), as 
well as Gly^^', Asp^'*^ and Arg^'^ for which no function has as yet been estabUshed (BCremer, 
supra.). The homology of MshA with known CAZy family 4 glycosyltransferases clearly 
indicates that MshA belongs to this group and is the glycosyltransferase required for 
biosynthesis of the pseudodisaccharide GlcNAc-Ins (Figure 1). 

(01831 M smegmatis mutants deficient in MshC exhibit enhanced sensitivity to 
hydrogen peroxide, redox cycling agents, and certain antibiotics. As such, MshC is a usefiil 
target for new drugs directed against mycobacteria. As set forth below in Example 12. a 
further examination of the M tuberculosis MshC hgase was performed by cloning, 
expressing, purifying, and characterizing the native form of the enzyme. Although active in 
the Cys:GlcN-Ins Hgase assay, it was found to be inactive in a cysteinyl-tRNA Ugase assay. 
The stoichiometry of the reaction catalyzed by the hgase was estabUshed experimentaUy 
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(Figure 17) and a preliminary examination was made of the enzyme kinetics, including a 
demonstration that the enzyme is not feedback inhibited by the product of the Ugase 
reaction, Cys-GlcNylns or by MSH at their physiologic concentiations. 

[0184] Mycobacterial proteins, when expressed in E. coli, often have solubility 
problems. MshC Ugase is the largest protein among all the MSH biosynthetic enzymes 
identified to date. It consists of 414 residues and tends to aggregate as inclusion bodies 
when expressed as a N-tenninal hexahistidyl tagged protem. The formation of AMP from 
ATP, along witii Cys-GlcN-Ins (Figure 18), is consistent with the view that the MshC 
enzyme action is a two step process in which the first step involves rapid activation of Cys 
to the Cys-adenylate (Cys-AMP) by the same mechanism as tiiat catalyzed by cysteinyl- 
tiEiNA synthetase (Sareen, et al., 2002, infira). 

[0185] As set forth in Example 12, in the MshC catalyzed reaction of GlcN-Ins to Cys- 
GlcN-Ins, tiie formation of AMP also includes the formation of pyrophosphate (PPi) (see 
Figure 1). Therefore, it is another embodiment of tiie invention to assay for the presence of 
PPi as a method of determining an effect of a compound on MshC. For example, in one 
embodiment, as set forth above, an inhibitor of MshC can be identified by contacting a 
candidate compound with a cystdne:glucosaminyl inositol ligase in the presence of a 
cysteine, a glucosaminyl inositol or a derivative thereof and ATP, under suitable conditions, 
and assaying for the generation of PPj. Because PPj is a byproduct of the MshC catalyzed 
reaction, a substantial absence of PPi is indicative of a candidate compound that inhibits 
activity of the Ugase. Such an assay for detection of the presence, creation or generation of 
PPi can include such assay techniques as fluoroscopy. Otiier assay techniques are weU 
known to those of skiU in the art. {See, for example, Upson et al. (1996) Anal Biochem. 
243:41-45; Nyren et al. (1994) Anal. Biochem. 220:39-45.) 

[01861 ■ In Example 12, the expression of the soluble protein was successfully achieved 
by cloning mshC in pACE expression vector witii M smegmatis as the host. The 
pACEmshC could also complement Af. smegmatis mycothiol mutant 164. which is deficient 
in MshC Ugase activity. The recombinant MshC Ugase was purified 17.8-fold to 
homogeneity and characterized in some detail. The protein is an active monomer of 34 kD, 
containing 0.7 mol of Zinc per mol of enzyme with K« values for L-Cys and GlcN-Ins to be 
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70 ± 15 pM and 280 ± 43 pM, respectively. The results of Example 12 show that Cys- 
GlcN-Ihs and AMP are produced in 1 :1 stoichiometric ratio as the products of the Ugase 
reaction. Nether the direct product, Cys-GlcN-Ins, nor the ultimate product, MSH, were 
found to feedback inhibit the MshC Ugase enzyme. Additionally, MshC was confirmed to 
have AlP-dependent Cys:GlcN-Ins Ugase activity only. The MshC Ugase recognizes E. coli 
tRNA-^y^ as its substrate with less than 2% the efficiency of the E. coli Cys-tRNA synthetase. 

[0187] The multiple sequence aUgnment comparison of M. tuberculosis MshC and CysS 
(CysRS) with E, coli CysS in the Ught of recently solved crystal structure ofE. coli CysRS, 
reveak that the active site of MshC also seems to be built &om the canonical Rossman fold 
domain, as seen in other Class I tRNA synthetases. This domain is divided into two halves, 
37-146 and 232-279 residues inM tuberculosis MshC (which corresponds to residues 22- 
131 and 208-254 in E. coli CysS). The Rossman fold domain is mterrupted by the 
connective polypeptide (CP) domain (147-231 in MshC. corresponding to 132-207 inR 
coli CysS). Out of the first 57 residues of the CysRS CP domain, only 12 residues are 
conserved in MshC, while out of the last 19 residues of the CP domain, 14 residues are 
conserved, with this stretch highly conserved among MshC orthologs in M leprae, S. 
coelicoior, C. striatum and Thermobifidafusca. MshC ortholog in Thermobifidajusca is 
53% identical to that of M tuberculosis found in 3.6 Mb sequence in TIGR unfinished 
microbial database. Interestingly. the£. coli CysRS CP domain (158-174 residues; 
corresponding to 178-194 of M tuberculosis MshC) the region where the most important 
nucleotide of tRNA*^. the discriminator base U73 interacts during aminoacylation (2i. 22) 
is not conserved at all in MshC. Also, the C-terminal disordered anticodon binding domain 
(residues 403-to 461 in E. coli CysRS) is absent in MshC protein, which is thought to get 
ordered in 4 antiparallel heUces upon tRNA*^ binding, in the ribbon model of CysRS. 

[01881 The crystal structure of ^. coli CysRS also reveals that Zn^* plays an important 
role in the amino acid discrimination in fevor of cystdne and against serine or alanine, 
without any editing function by the enzyme (Newberry, et al. (2002) EMBO J. 21, 2778- 
2787.). Zn^* has been proposed to accurately position the a-carboxylate of the substrate 
cysteine for the nucleophiUc attack on ATP to form cys-adenylate (Cys-AMP). The amino 
acid residues proposed to be involved in the cysteine bmding step (Cys28. Cys209. ffis234) 
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are conserved in the MshC ligase and also in the M tuberculosis CysRS gene (CysS) as 
weU as in aU orthologs of MshC identified in olher species (Figure 18). But, Zn^* has not 
been shown to be required for the catalytic action of E. coli or M. tuberculosis CysRS. 

[0189] A directed knockout in M. tuberculosis of mshB showed surprising weakness to 
oxidants and antibiotics even though it produced at least 20% of the parent level of MSH 
(Buchmeier, et al., (2003) Mol Microbiol. 47:1723-1732.). The compensating deacetylase 
activity in this mshB knockout inM tuberculosis was proposed to be the mycothiol S- 
conjugate amidase Rvl082, a homolog of mshB (Rvll70). M smegmatis mutants in mshA 
(Newton, et al. (2003), submitted) and mshC (Rawat, et al.. (2002). Antimicrob. Agents 
Chemother. 46:3348-3355) have been shown to be devoid of mycothiol and thus these 
genes are the best choices for testing the essentiahty of mycothiol in M. tuberculosis. Jn 
studies with M. smegmatis we have found mycothiol mutants with <5% mycothiol in mshA, 
mshC and mshD. Example 13 below shows that mshC and mycothiol are indeed essential 
for M tuberculosis Erdman, and mshC is provided as a potential drug target. The 
essentiaUty of mycothiol inM. tuberculosis Erdman contrasts with the requirement of 
mycotiiiol in M smegmatis. In M smegmatis complete mycothiol mutants have been 
isolated which indicates that this organism is much less sensitive than M tuberculosis to 
oxidants encountered during normal metabolism in laboratory culture. Mycothiol appears 
important for the antioxidant defenses of M tuberculosis in laboratory culture and may be 
even more critical to the organism during times of oxidative stress such as intracellular 
colonization of the pulmonary macrophage. 

[0190] A knockout of mshC was attempted by allelic exchange via efficient 
mycobacteriophage transduction (Bardarov, et al. (2002), Microbiol, 148:3007-17) which 
gave no surviving transformants with the mshC gene altered. All hygromycin resistant 
transformants examined had parental mycothiol levels and were deemed to be spontaneous 
hygromycin mutants. One interpretation of these results is that mshC or some gene 
downstream of mshC may be essential. 

[0191] The following examples are intended to illustrate but not Umit the invention. 
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EXAMPLE 1 

roENTmCATION OF MshC 

[0192] Bacterial culture. M. smegmatis strain mc^ 1 55 was grown in Middlebrook 7H9 
broth (Difco Laboratories) supplemented with 0.05% TWEEN™ 80 (Fisher) and 0.4% 
glucose (Fisher) at 37 °C and 250 rpm. After 28 h of cultivation, the bacterial cells were 
collected by centrifugation at 8000 g for 15 min. The ceU pellets were stored at -70 "C until 
used. 

[0193] Reagents. MSH was isolated from M smegmatis and derivatized with 
monobromobimane (mBBr, Molecular probes) to form the monobromobimane derivative of 
mycothiol (MSmB) and purified as described earUer (G.L. Newton et al. (1993) J.Bacteriol. 
175, 2734-2742) . GlcN-Ins was prepared by hydrolyzing MSmB quantitatively, with 
purifiedM smegmatis mycothiol S-conjugate amidase, as previously described (G.L. 
Newton et al. (2000) Biochemistry 35, 10739-10746). CySmB-GlcN-Ins was purified by 
preparative high performance Uquid chromatography (HPLC), after acid hydrolysis of 
MSmB, as described earlier (S. Anderberg et aL (1998) supra). 

EXAMPLE 2 

[0194] Assays. A minor modification of the protocol described by Anderberg et al 
(supra) was used for routine measurement of ATP-dependent ligase activity. The enzyme 
activitywasassayedinafinalvolumeof 25 ML containing 12.5 pLofthe ligase in different 
dilutions, 50 pM GlcN-Ins. 100 pM Cys (Calbiochem), 1 mM ATP (Sigma), 1 mM MgCh 
(Fisher), 1 mM DTT (Calbiochem), 25 mMiV-2-hydroxyethylpiperazine-Ar-2- 
ethanesulfonic acid (HEPES) (Sigma), pH 7.5, and 35 pM each of the protease inhibitors 
(Sigma) //-a-p-tosyl-L-phenylalanylchloromethyl ketone and JV-ff-p-tosyl-L- 
lysinechloixjmethyl ketone. The mixture was incubated at 37 "C for 30 min. The assay was 
terminated by addition of 25 pL of 8 mM mBBr in acetonitrile and heating the mixture at 60 
»C for S min to derivatize the thiols. The derivatization was quenched with the addition of 
150 Ml of 10 mM methanesulfonic acid and vortexing. HPLC analysis of CySmB-GlcN-Ms 
and CySmB was carried out by HPLC using a Beckman Ultrasphere IP (250 x 4.6 mm) 
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analytical colunin fitted with a Brownley OD-GU 5 n C-18 cartridge using the following 
linear gradients: 0 mm., 100% A (0.1% TFA m water); 10 min, 100% A; 30 min , 80% A; 
33 min, 100 % B (7.5 % methanol in acetonitrile); 36 min, 100 % B; 38 min, 100 % A; 60 
min, 100 % A (reinjection). The flow rate was 1 mL/min. and the fluorescence detection 
was accompUshed with a 370 mn excitation filter and a 418-700 nm emission filter. The 
Fn«x and K„^ values were calculated by least-squares analysis of Badie-Hofstee plots with 
initial rate data using KaleidaGraph 3.5 (Synergy Software). With [ATP] =1 mM and 
[GlcN-Ins] = 300 pM, the apparent Km for Cys (5-400 mM, n = 8) was determined to be 40 ± 
3 MM and F™ax was 83 ± 3 nmol imn'* mg'. For [ATP] = 1 mM and [Cys] = 100 pM the 
apparent Km for GlcN-Ins (5-600 pM, n = 7) was 72 ± 9 pM and was 90 ± 7 nmol min" 
^ mg-* Protein concentration was measured by the method of Bradford (M.M. Bradford. 
(1976) Anal. Biochem. 72, 248-254) using BSA as standard. 

EXAMPLES 

[0195] Purification ofLigase. All operations were carried out at 4 °C in the presence of 
3 mM 2-mercaptoethanol unless stated otherwise, mc^ 155 cells (182g wet wt.) were 
suspended in 780 mL of 50 mM HEPES pH 7.5 in the presence of 35 pM of the protease 
inhibitors iV^a-p-tosyl-L-phenylalanylchloromethyl ketone and N-a-p-tosyl-Lr- 
lysinechloromethyl ketone. The ceUs were disrupted by ultrasonication (Branson Sonifier 
200) in an ice bath. The cell debris was removed by centrifugation at lOOOOOg for 1 h at 4 
-C. A solution of saturated ammonium sulfate (Fisher) was added to the supernatant to 15% 
and the mixture allowed to stand on ice for 2 h. After centrifiigation at 28000g for 30 min, 
additional SAS was added to the supernatant to 50% saturation and the mixture stored 
overnight at 4 "C. After centrifiigation at 28000^ for 30 mm the protein peUet (32g)was 
resuspended in 500 mL of 50 mM HEPES pH 7.5 and was desalted by passing it through 
Sephadex G-25 column (7.5 x 36 cm). The collected eluent (550 mL) was q>pUed on a 
Toso Haas DEAE 650-M column (5.2 x 26 cm, 500 mL) equiUbrated with 50 mM HEPES, 
pH 7.5. The enzyme was eluted with a linear gradient of 0-0.4 M NaCl in 16 column 
volumes of the buffer at 600 mUh. (Figure 3A) The firactions (#315-350) containing the 
third peak of enzyme activity were combined (670 mL) and were diluted three-fold with 
MiUi-Q water to lower the salt concentration. The diluted solution was appUed to a 
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hydroxyl apatite column (Bio-gel HTP ftom BioRad; 2.6 x 26 cm) at 120 mlVh, which was 
pre-equiUbrated with 10 mM potassium phosphate (Fisher) buffer pH 6.8 containing 100 
mM NaCl. The bound protdns were eluted at 240 mL/h with a linear gradient of 10-100 
mM phosphate (100-0 mM NaCl) in 20 column volumes. The activity in fractions (#75- 
125) was collected (800 mL) (Figure 3B)and diluted to 1800 mL with MiUi-Q water to 
lower the salt concentration. The pH was adjusted to 6.4 with 1 M potassium dihydrogen 
phosphate. The diluted material was appUed to a Reactive Brown 10 dye affinity column 
(Sigma, 1.5 X 11 cm, 20 mL) at 60 mL/h pre-equiUbrated with 50 mM potassium phosphate 
buffer pH 6.5 and washed with buffer until no absorption was evident in the efQuents (200 
mL , 120 mUh). The Ugase was eluted with 50 mM phosphate buffer pH 8.0 at a flow rate 
of 120 rimh. The fractions containing optimal Ugase activity (#12-64) were collected (104 
mL) (Figure 3C). The protem was concentrated by adding soUd ammonium sulfate to 80% 
saturation and allowing precipitation to continue on ice for 2 h. It was collected by 
centrifiigation at 28000^ for 30 min at 4 and resuspended in 50 mM HEPES pH 7.5 
containing 150 mM NaCl and 3 mM 2-mercaptoethanol. This was loaded on a Sephadex 
G-lOO column (Pharmacia, 1.5 x95 cm, 170 mL) and eluted with the same buffer. The 
active fractions were collected (2 mL each) (Figure 3D) and analyzed for purity on 12.5% 
sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGB). Purified protem 
thus obtained was stored at 4 °C for further characterization. 

The results of this multi-step purification are shown in Table 1 below: 



Table 1. Purification of M smegmatis Cvs:GlcN-Ins Ugase (MshC) 



total activity specific activity Yield Purify 



Step 


(mg) 


(smol min'') 


(nmolmin"' mg"') 


(%) 


factor 


crude extract 
15-50% SAS 
DEAB ion exchange 
hydroxyl apatite 
Reactive Brown 10 


9660 
6900 
847 
90 
15 


227 
1030 
465 
300 
150 


0.023 
0.15 
0.55 
3.3 
10 


(100) 
454 
205 
132 
66 


(1) 
6.5 

24 

139 

435 


Affinity 












chromatography 
gel filtration G-lOO 


0.2 


10.8 


54 


4.8 


2350 
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[01961 • The first purification st^ involving a 15-50% ammonium sulfate firactionation 
resulted in a 6.5-fold increase in specific activity wifli very Utile loss of protein. After 
desalting and chromatography on DEAE 650-M, the elution profile (Figure 3 A) revealed 
three major peaks of activity, the major portion of the activity residing in tiie first and tiiird 
peaks. Experience revealed that the third peak was more readily susceptible to further 
purification and the centi^ fractions fi»m this peak were combined for chromatography on 
hydroxyl-apatite (Figure 3B) with activity eluting primarily in one broad peak. The most 
active fractions when combined and bound to a Reactive Brown-10 dye affinity column at 
pH 6.5 and at pH 8 in a sharp but tailing peak. 

[01971 The purification was monitored by SDS-PAGE, which suggested that only a few 
proteins of varying size were present after chromatogr^hy on the Brown-10 column. The 
peak activity from this column was pooled, precipitated with ammonium sulfiite, and taken 
up in HEPES buffer pH 7.5 for chromatography on Sephadex G-100. The activity eluted in 
two peaks corresponding to Mr ~1 50000 and -89000 (Figure 3D), the second peak 
exhibiting highest specific activity. 

[01981 Each fraction of the second peak was concentrated and examined by SDS-PAGE. 
SDS PAGE gels showed purification of MshC. The gel was prepared as follows: Qsn& 1,7 
and 8) Bio-Rad Broad Range molecular mass standards; Gane 2) crude cell fi»e extract 
(lane 3) 15-50% saturated ammonium sulfate extract; (lane 4) pooled fractions #315-350 
from DEAE-M chromatography; (lane 5) pooled fractions #75-125 from hydroxyl apatite 
chromatography; Oane 6) pooled fractions #12-64 fix)m Reactive Brown 10 
chromatography; (lane 9) Sephadex G-lOO fraction 50 (lacking Ugase activity) and (lanes 
10-15) fractions #52-57, respectively, with purified Ugase. Two closely migrating proteins 
of Mr ~ 47000 were detected (in lanes 10-14), witii the larger protein rich in the earUer 
fractions (lanes 10 and 1 1) and the smaller protem dominant in the later fractions (lanes 13 
and 14). 

EXAMPLE 4 

[01991 Protein sequencing. Amino acid sequencing of tiie two active Ugase proteins 
(obtained in Example 3 above) was performed after electioblotting the two bands of active 
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Ugase from 12.5% SDS-PAGB to a polyvinyUdene difluoride membrane. The N-tenninal 
amino acid sequence was detennined on an Applied Biosystems Model 494 Precise gas 
phase protein sequencer at the UCSD Department of Biology Protein Sequencing FaciUty, 

EXAMPLES 
Cloning of the M. tuberculosis MshC ligase. 

[02001 Confirmation that cysS2 of M tuberculosis codes for MshC was obtained by 
using PGR to clone the gene into pRSBTA. M. tuberculosis H37Rv genomic DNA (kindly 
provided by Y. Av-Gay, University of British Columbia) was employed to amplify the 
mshC gene by PCR using the primers: 

r)F-l (5'.GCGGATCCATGCAGTCGTGGTATTGCCC-3') (SEQ ID NO: 10) and 
DR-1 (S-CCAAGCTTCTACAGGTCCACCCCGAGCA-SO (SEQ ID NO: 11) and 
Platinum Pfic DNA polymerase (Gibco BRL). The PCR product was cloned into the BamR 
VHind m (New England Biolabs) sites of pRSETA (Invitrogen) using T4 DNA Ugase 
(Gibco BRL). The expression plasmid was transformed into E. coli TOPlOP (Invitrogen) 
and the presence of the insert verified by cleavage with BamS. I and Hind m. The MshC 
ligase, which contains an N-teraiinal ffise extension, was expressed in E. coli BL21 (DE3) 
plys S (Invitrogen) with induction overnight by 0.5 mM isopropyl-a-D- 
thiogalactopyranoside (Fisher) at 20 »C. Sequencing of the cloned DNA verified that the 
cloning was accurate. The Hise-tagged protein was expressed in E coli after induction with 
isopropyl-a-D-thiogalactopyranoside. Assay of the crude extract with 1 mM ATP, 0.1 mM 
Cys, and 50 mM GlcN-Ins gave 0.12 nmol min'' mg ' of Ugase activity whereas analogous 
me^urements on E coli transformed with the blank vector yielded no measurable Ugase 
activity (<0.01 nmol min ^ mg'^). 

EXAMPLE 6 
Mutagenesis of M. smegmatis mc^lSS 

[0201] Strains and chemicals. Escherichia coli DH5a (FrecAl hsdSAl thiA gyrA96 
supEU endM relAl recAl deoR Hlac7YA-argF)\J169 (<|>80 lacZ AM15) was grown in 
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Luria-Bertani (LB) medium at 37"C. Streptomycin (50 jig/ml), kanamycin (100 ng/ml) 
ampicillin (100 ng/ml) and hygromycin (150 fig/ml) were used when required. M 
smegmatis mcHSS was kindly provided by Dr. W.R. Jacobs and was grown in 7H10 
medium with 10% OADC and 0.05% TWEHST™ -20 (PBT). Kanamycin (25 |ig^al) and 
hygromycin (50 ng/ml) were used when required. Chemicals were purchased from Sigma 
Chemicals (St. Louis, Mo.). 

I. Preparation of chemical mutants lacking MshC gene activity 

[0202] Chemical mutagenesis of M. smegmatis mcHsS with JV-Methyl-AT-nitro-iV- 
nitrosoguanidine (MNNG; Aldrich) to produce mutant 164 was performed as described by 
Newton et al. (1999) (JBiochem. Biophys. Res. Commun. 255:239-244) except that diamide 
was not added to the plating medium. M. smegmatis strains. mcHSS and 164 were grown at 
3TC while the transposon mutants were grown at 42*'C. 

[02031 A grided Ubrary of 1600 M. smegmatis mcMsS clones which were subjected to 
an initial screen for MSH content using a blotting immunoassay protocol (M.D. Unson et al. 
(1998) J. hnmunol. Methods, 214:29-39). A total of 56 colonies were selected for further 
examination. These were grown in Uquid culture and the cells extracted for HPLC analysis 
of MSH content. Although several mutants appeared to have reduced MSH content, one 
mutant, strain 164. was found to have significantly lower MSH content (see Table 2) and 
was selected for fiuiher study. 

n. Preparation of Tn611 transposon mutants lacking MshC activity 

[02041 The transposon mutagenesis of M smegmatis mcHSS cells was performed as 
described by Guilhot et al. , 1994 (J. Bacteriol. 176:535-539) using the thermosensitive 
plasmid pCG79 containing Tn61 1 . The plasmid was electroporated into M. smegmatis and 
transformants were selected on PBT media supplemented with kanamycin at 30°C. 
Randomly chosen clones were grown at 30''C for 72 h in 5 ml of 7H9 medium 
supplemented with kanamycin. These cultures were used to inoculate antibiotic-free 7H9 
media and the inoculated cultures were grown for 24 hours at 39''C. Various dilutions were 
then plated on PBT medium siq>plemented with kanamycin and incubated at 39''C. 
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m. Molecular biology manipalations 

[02051 Genomic DNA was isolated from M. smegmatis culture according to Jacobs et 
al, (supra). Standard recombinant DNA techniques and Southern blots were carried out as 
described by Sambrook et al, 1989 {Molecular cloning: a laboratory manual 2"'* ed. Cold 
Spring Harbor Laboratory Press. Cold Spring Harbour, N.Y.). M. smegmatis 
electroporation was performed as described by Snapper et al, 1988 {Proa. Natl Acad. Sci 
USA. 85:6987-6991). 

[0206] Using the above mutation procedures, a transposon mutant library in Af. 
smegmatis consisting of 10,000 mutants was created using the thermo sensitive plasmid 
pCG79 (Perez et al. (1998) Methods Mol Biol 101 :187-198). One thousand representative 
mutants were picked, sorted, and cultured individually in ELISA plates. The mutants were 
screened by ELISA utilizing a polyclonal antibody against MSH (M.D, Unson et al. (1999) 
J. Clin. Microbiol. 214:29-39). Three independent MSH deficient mutants were identified as 
shown in Table 2 below: 

Table 2 

mc^l55 164 Tnl Tn2 Tn3 

GroAvthon - + + + + 

5 0ug/ml INK . 

A405nm'' 1394 0.396 0.256 0.278 0.286 

A600mn 0.208 0.405 0.279 0.320 0.288 

Signal/Growth** 4.629 0.978 0.918 0.869 0.993 
Growthat42'*C ND ND + + " 

onlNH 

Absorbance of the alkahne phosphatase product in the MSH- capture ELIZA assay. 



proportional to the MSH content of the cells. 

[0207] Only two of these mutants (Tnl and Tn2) were found to stably incorporate the 
Tn61 1 transposon into the genome as judged by Southern analysis. If the insertion is in the 
genome, three bands are expected when using IS6100 as a probe for hybridization to Pstl 
digested genomic DNA of the mutants; two identical to those in the parent plasmid (0.9kb 
and 2kb) and a third one of a different size that in the present case turned out to be of 4.5kb 
in size for Tnl and Tn2. A faint band of 4.5kb was also observed in initial Southern 
hybridization examining Tn3 genomic DNA. However, subsequent experiments revealed 
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that the transposon insertion in Tn3 is not stable as this mutant failed to grow at 42**C on 
selective plates (Table 2). Thus, phenolic analysis of the Tj)3 mutant was aborted. 
Southern hybridization with IS61 10-labelled probe also confirmed that only one transposon 
was presait in the chromosome for each of the two transposon mutants Tnl and Tn2. 

EXAMPLE? 

Assay of MSH and MSH precursors in MshC deficient mutants. 

[0208] The invention mutants were also analyzed for MSH and amine-intamediate 
levels during exponential growth using sensitive HPLC assays. Labeling of cell extracts 
with monobromobimane (mBBr) to determine thiol content was performed wilh 
modifications to previously pubUshed protocols (Anderberg et al., supra and GX. Newton et 
al., 1996, supra). Cell pellets fix)m 3 ml culture were resuspended in 0.5 ml of warm 50% 
acetonitrile-water containing 2 mM mBBr (Calbiochem, San Diego, Ca), and 20 mM 
HK*ES,.pH 8. The suspensions were incubated for 15 min at 60°C water bath and then 
cooled on ice. A 2-5 \il aUquot of 5M HCl or 5M trifluoracetic acid (TFA) was added to 
produce a final acidic pH. 

[0209] Control samples were extracted with 0.5 ml of warm 50% acetonitrile-water 
containing 5mM NEM and 20mM HEPES, pH 8. The suspensions were incubated for 15 
min at 60°C and then cooled on ice. After addition of mBBr to 2mM the solution was 
incubated for a second time for 15 min at 60''C. The control samples were cooled but not 
acidified. Cell debris was pelleted in each sample by centrifugation (5 min, 14,000 x g) in a 
microcentriftige. 

[0210] HPLC analysis of thiols was carried out by injecting 25 ^1 of a 1 :4 dilution of the 
samples'in 10 mM HCl onto a Beckman ULTRASPHERE™ ODS 5^ (250mm x 4.6mm) 
column. Elution was accomplished with 0.25% glacial acetic acid pH 3.6 (Buffer A) and 
95% methanol (Buffer B) using the following gradient: Omin; 10% B, 15min; 18% B, 
BOmin; 27% B, 32min; 100% B, 34miI^ 10% B, and 60min; 10% B, (reinjection). The flow 
rate was 1 ml per min. and the fluorescence detection was as previously described. 
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Tables 


M smegmatis 
Strain 


GlcN-Ins 
(^oles/gm dry weight) 


Ligase specific 
activity 

(mnole/min/mg 
protein) 


Mycothiol 
(^moles/gm dry 
weight) 


mc^lSS 


0.1 


0.320 ± 0.010 


10.0 


164 


2.5 


0.004 ±0.001 


0.1 


Tnl 


2.6 


0.004 ±0.002 


<0.004 


Tn2 


1.9 


0.004 ±0.002 


<0.003 



EXAMPLES 

Assay of ligase activity assay in MshC deficient mutants. 

[0211] The Ugase specific activity was estimated by the ATP-dependent formation of 
Cys-GlcN-Ins catalyzed by M. smegmatis cell-free extracts pre?)ared and analyzed in 
tripUcate as described in Example 2 above (Anderberg et al., si^ra) except that 
centrifiigation of the extracts was conducted in a microcentrifixge for 5 min at 14,000 x g. 
As shown by the results of these studies s\unmaiized in Table 3 of Example 7, all three 
mutants and strain 164 were found to be deficient in MSH and overproduced GlcN-In&by 
about 25-fold. 

[0212] In all cases the content of Cys-GlcN-Ins, the Ugase product, was undetectable 
(<0.002 mmol/gm dry weight). A block in Cys:GlcN-Ins Ugase activity would be expected 
to result.in an accumulation of GlcN-Ins. These findings suggest that the Ugase gene (mshQ 
has been interrupted in the transposon mutants. In addition, the results clearly show that 
Tnl and Tii2 are similar to the chemical mutant 164, all having a very low level of MshC 
activity in cmde extracts (Table 3 of Example 7). 

EXAMPLE 9 

Toxici^ studies and antibiotic sensitivity tests in mutants. 

[0213] It has previously been shown that MSH takes part in detoxification of alkylating 
agents. Consequently, mutants lacking MSH should then be more susceptible to these 
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agents. The fluorescent alkylating agent, mBBr, selectively reacts with cellular thiols and 
mycobacteria possess a mycothiol-dependent amidase fliat can detoxify mycothiol- 
monobimane conjugates (G.L. Newton et al (2000a) siq)ra.) Lack of mycothiol would 
presumably result in increased sensitivity to this toxin. 

[0214] To test this hypothesis, Tnl, Tn2, and 164 were grown on PBT plates 
supplemented with increasing amounts of the alkylating agents mBBr, iodoacetamide, and 
l-chloro-2,4-dinitrobenzenei (CDNB). The alkylating agents were poured into the molten 
agar. Table 4 below shows the minimum inhibitory concentrations of various alkylating 
agents against mc^l55 and MSH* mutants. 

Table 4 





Minimum Inhibitory Concentration Oig/ml) 


Agent 


Wild 


164 


Tnl 


Tn2 




Type 








CDISB 


0.05- 


0.075 


0.025 


0.025 




0.075 








mBBr 


0.1 


0.075 


0.025 


0.025 


Iodoacetamide 


0.05 


0.025 


0.025 


0.010 


Diamide 


10< 


5 


<1 


<1 



(02151 The transposon mutants Tnl and Tn2 were 4-fold more sensitive to mBBr as 
compared to wild type mch 55. The chemical mutant, 164, which has 1% of Hie wild type 
MSH content, was also more susceptible to mBBr, but not to the same extent as the 
transposon mutants, which have no detectable MSH. Iodoacetamide is another alkylating 
agent that is commonly used to derivatize proteins. Mutants 164 and Tnl were two-fold 
more sensitive to iodoacetamide, and mutant Tn2 5-fold more sensitive, as compared to the 
wild type mc^l55 strain. Tnl and Tn2 were also more sensitive to CDNB (2-3 fold), another 
alkylating agent and a glutathione S-transferase substrate, as compared to wild type mc^l55. 
In the case of the thiol-specific oxidant, diamide, which readily penetrates cells and oxidizes 
thiols to disulfides. CN.S. Kosower et al., Meth. Enzymol. 251:123-133), a greater than 10- 
fold increase in sensitivity of the transposon mutants was observed. Sensitivity of 164 to 
diamide was doubled compare to the wild type. In summary, the transposon mutants lacking 
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mycothiol wwe more susceptible to alkylating agents than wild type M. smegmatis mc^l55 
although the degree of susceptibility depended on the toxin tested. 

[0216] . Peroxide sensitivity was determined by incubating cells (2.5x10^ per ml) at 37** C 
in Middlebrook 7H9 medium with 0.4% glucose and 0.05% TWEEN® 80 containing 
hydrogen peroxide at the desired level for 2 hours with shakmg. Cells were diluted in fresh 
medium and plated on 7H10 agar. Colonies were counted when they achieved a diameter of 
1-2 mm, which required 4-5 days for the wild type strain and up to 10 days for MSH- 
deficient mutant strains. 

[0217] To determine sensitivity to redox cycling agents, disk assays were performed. 
Briefly, cells were grown to mid-log phase and a lawn of cells was plated onto PBT or LB 
plates supplemented with appropriate antibiotics. Various amounts of redox-cycling agents 
plumbagin, and antibiotics menadione and nitwfurantoin were added to the disk in 10 
volume and allowed to dry. The disks were placed onto tiie lawn of cells and incubated 2-3 
days. 

[02181 The results of Figure 6 show that Tnl and Tn2 are about 10 times more sensitive 
to hydrogen peroxide tiian the wild typ6 strain. Tnl and Tn2 were unable to tolerate as Utile 
as ImM H2O2 while the wild type strain can survive at 10 mM H2O2. Tolerance of Stirain 
164 is intermediate between wild type mch55 and tiie transposon mutants in its 
susceptibility to peroxide stress (Figure 6). 

[0219] When the oxidative stress is in tiie form of redox-cycling agents that increase tiie 
superoxide concentration in tiie cell, a similar pattem is seen. As seen in tiie results 
summarized in Table 5 below, Tnl and Tn2 mutants are markedly more sensitive to 
plumbagin and the antibiotic nitirafurantoin than the wild type straiiL 
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Tables 



Agent concentration Zone of Inhibition (mm) 



[jrniol] Wild Type 164 Tnl Tn2 

0 01 Plumbagin 13.5±0.5 13.3±1.9 23.0±1.2 23.afcl.0 

0.005 Plumbagin 11.0±1.0 9.3±0.9 17.at0.6 15.0±0.0 

0.5 Menadione 19.7±0.9 26.7±0.9 31.3±1.3 16.4±1.9 

0.1 Menadione 9.7±0.3 8.7±0.7 20.7±3.8 15.3±2.0 

10 Nitrofurantoin 6.0±0.6 8.7±0.3 14.0±2.7 35.0±3.6 

0.5 Nitrofurantoin 4.7±0.3 5.0±0.6 10.7±2.6 ND 

[02201 The results for the transposon mutants with menadione showed no consistent 
pattern and are therefore inconclusive. Mutant 164, which contains a low level of MSH. is 
less sensitive to the above agents, undergoing a significant increase in the zone of inhibition 
only in the case of 0.5 junol menadione. 

Results of antibiotic sensitivity tests 

[0221] Since the invention mutants are devoid of MSH they cannot carry out MSH- 
dependent detoxification via the pathway involvmg mycothiol 5-conjugate amidase activity 
(G. L. Newton et al 2000a, supra). To test the hypothesis that MSH is involved in 
protection against antibiotics, the antibiotic sensitivity of the transposon and chemical 
mutants to a variety of antibiotics, including erythromycin, azithromycin, vancomycin, and 
penicilUn G. was tested by application of 0. 2jig. 7^g, 8^g. 32fig, 125^g or 250ng of 
antibiotic to the disks. The results of these tests summarized in Table 6 below show that the 
transposon mutants Tnl and Tn2 and the chemical mutant 164 are 3-16 fold more sensitive 
than the parent strain (mcH55) to the antibiotics erythromycin, azitinx>mycin. vancomycm, 
and penicillin G. 
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Table 6 



Drag Minimu m Inhibitory Concentration (jig/ml) 





mc^l55 


164 


Tnl 


Tn2 


Isoniazid 


2 


32 


>250 


>250 


Eryfliromycin 


125 


32 


7.8 


32 


Azithromycin 


7.8 


2 


2 


2 


Vancomycin 


6 


1 


2 


1.5 


Penicillin G 


>250 


>250 


32 


32 


Rifampin 


32 


32 


7.8 


7.8 


Rifamycin 


32 


32 


7.8 


2 



(02221 Examination of drugs used to treat M tuberculosis infection, rifamycin and 
rifampin, revealed that all the transposon mutants are more sensitive to these drugs than th« 
chemical mutant l64 (Table 6), but no significant difference was observed for ethambutol c 
pyrazinamide. The opposite effect was observed for isoniazid, to which the mutants are 
greater than 100-fold more resistant than the wild type while 164 exhibits only 16-fold 
greater resistance (Table 6). 

EXAMPLE 10 
roENTmCATION OF MshD 
Bacterial strains and growth conditions 

[0223] M. smegmatis strain mc^l55 was obtained from W, R. Jacobs (Department of 
Microbiology and Immunology, Albert Einstein College of Mediciae, Bronx, New York 
USA) and cultured at 37 in Middlebrook 7H9 media supplemented with 0.05% Tween 
80 and 1% glucose imless otherwise noted. 

Construction of mutant library 

[0224] The gram negative bacterial Tni transposition system was selected to generate 
this M smegmatis mutant Ubrary due to its random target sequence specificity. An M 
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smegmatis (Tn5) EZ::TN <lCAN-2> (Epicentre) transposon library was prepared according 
to the manufecturer's instructions (Derbyshire et al. (2000) Epicentre Forum 2:1-4; 
Goryshin et al. (2000) Nature Biotechnol 18:97-100). Electrocompetent M. smegmatis 
strain mc^l55 cells were prepared as described in Example 6. The library contained mutants 
fiom 20 separate l-jil transformations plated on 20 150-mm dishes each containing 125 ml 
Trypticjise Soy Broth medium (BBL) with 10 ^g kanamycin/ml and 30 jig isoniazid/ml. 
Reactions without DNA and with DNA were plated without isoniazid as controls. 
Transformants resistant to both kanamycin and isoniazid w«:e gridded and replated on 7H9 
Middlebrook agar supplemented with 0.4% glucose, 0.05% Tween 80, 20 ng kanamycin/ml 
and 100 \ig isoniazid/ml. Transformants that survived the increased levels of antibiotics 
were subcultured into 5 ml 7H9 Middlebrook media containing 0.05% Tween 80, 1% 
glucose, 20 Jig kanamycin/ml at 37'»C with shaking. Cells fix)m a portion (2 ml) of the 
stationary phase culture were collected by centrifugation and analyzed for thiols as 
described below. Transformants containing less than 1% of the wild-type mycothiol content 
were retained for genetic analysis. 

Determination of thiols and GlcN-Ins 

[02251 Cells were extracted and labeled with mBBr for thiol analysis as described 
previously with the exception that HEPBS was substituted for Tris. The thiol-bimane 
derivatives (RSmB) were analyzed using a minor modification of HPLC method 1 CFahey 
and Newton (1987) Methods Enzvmol 143:85-96) to provide improved separation of MSmB 
ftom the major reagent hydrolysis peak. A 4.6 x 25 cm Beckman ULTRASPHERE® ODS 
ff (ion pairing) column (No. 235335) was used with buffers A (0.25% acetic acid in H2O 
titrated to pH 3.6 with NaOH) and buffer B (HPLC grade methanol. Fisher). The linear 
gradient was as follows: 0 min, 10% B; 5 min, 10% B; 15 min, 18% B; 30 min, 27% B; 32 
min, 100% B; 34 min, 10% B; 45 min, reinject. The flow rate was 1.2 ml/min at ambient 
temperature and retention times were as foUows: CySmB-GlcN-Ins, 9.4 min; CySmB, 10.8 
min; U18, 18 mim mBOH (reagent hydrolysis), 19.3 min; MSmB, 21 min; AcCySmB. 26 
min. 
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Genetic analysis of MSH-deficient mutant mshD::Tn5 

[0226] Genomic DNA was purified from a 250 ml culture of mshD: :Tn5 by standard 
methods (Larsen, "Genomic DNA preparation" In: Hatfull GF , Jacobs J, W. R. (eds) 
Molecular Genetics of Mycobacteria, ASM Press, Washington D. C, pp 313-320, 2000) . 
Initial attempts to sequence outward ftom the insert with intact genomic DNA according to 
the manufacturer's instructions gave inadequate sequence quality. Therefore the transposon 
and adjacent genomic DNA (Figure 9) was subcloned into pUC18 for sequencing. 
Genomic DNA was digested with SaR arPstI (Ihvitrogen; Figure 9) and cloned into 
similarly digested pUC18 plasmid DNA (Sambrook et al. 1989. supra). E. coli DH5a 
(Invitrogen) was transformed with the resulting plasmids and the kanamycin resistant 
transformants were analyzed for the presence of inserts larger than the EZ::TN <XAN-2> 
transposon (1.2 kb), indicating the presence of M smegmatis genomic DNA. The purified 
plasmid DNA with insert and the EZ::TN <KAN-2> reverse primer RP-1 (Epicentre) were 
submitted to the UCSD Cancer Center Sequencing Center for sequence determination. The 
sequences were analyzed using pubUshed and unpubUshed genome sequence data at the 
Sanger Centre website (on the worldwide web at sanger.ac.uk). Preliminary sequence data 
for M. smegmatis mc^SS was obtained bom The Institute for Genomic Research website 
on the worldwide web at tigr.org. 

[0227] Attempts to identify the disrupted mycothiol biosynthesis gene by sequencing 
outward from the transposon using intact genomic DNA gave a sequence with many 
undetermined bases and the approach was unsuccessfid at identifying the MshD gene. The 
EZ::TN <KAN-2> transposon with adjacent genomic DNA (Figure 9) was subcloned into 
pUC18 for sequencing. The Pstl clone gave 78 bp of sequence complementary to bases 
1364224-1364301 of the unfinished M. smegmatis genome (gnl|TIGR_1772|contig:3312) at 
the TIGR website. The Sal /plasmid gave 504 bp of sequence complementary to bases 
1363797-1364301, a longer sequence of the same gene at the same insertion point (Figure 

9). 
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Cloning of the M. tuberculosis mshD gene and expression in E. coli 

[0228] The mshD gene (Rv0819) was amplified from M. tuberculosis H37Rv 
chromosomal DNA using Platinum Pfe DNA Polymerase (Invitrogen) with oUgonucleotide 
primers 

5'-GGCTTGCAGGTGACGGCGCTTGACTGGCGCT-3' (SEQ ID NO: 23) and 

5'-GGAAGCTTGTTATCCGTGCCAGCCAGCGCG-3' (SEQ ID NO: 24), 

which generated a polymerase chain reaction (PGR) product containing a unique Pstl cut 
site at the S'-temiinus and a Hind, m site at the 3'-teraiinus. The PGR product was purified 
fiom a 1.2% agarose gel using a Qiagen gel extraction kit. The PGR fiagment was then 
digested witii Pstl and HindSR according to the manufiicturer's instructions and purified 
fiom a 1 .2% agarose gel as above. The expression vector, pRSET C (Invitrogen), was 
similarly digested and purified. The digested mshD gene and vector were Ugated with T4 
DNA Ligase (Invitrogen) to generate plasmid pRS0819. The ligation mixture was used to 
transform competent E. coli DHSa cells, which were plated on LB agar (Sambrook et al. 
1989, supra) containing 100 |ig ampicillin/ml. 

[0229] The transformants were cultured in 5 ml of LB broth containing 1 00 jig 
ampicillin/ml overnight at 37°C. The plasmids were purified with a Qiagen mini prep kit, 
digested with Pst I and Hind m as above, and analyzed by electrophoresis to verify the 
presence of the mshD gene (-950 bp) insert. The expression plasmid pRS0819 and the 
empty pRSET C vector were used to transform E. coli BL21 (DE3) pLysS (Invitrogen). 
Transformants were selected on LB agar containing 100 \ig ampiciUin/ml and 35 ng 
chloramphenicol/ml to maintain plasmid pLysS. 

Preparation of Cys-GIcN-Ins 

[0230] • The bimane derivative of Cys-GlcN-Ins (CysSmB-GlcN-Ins) fix)m the acid 
hydrolysis of MSmB prepared as described above was used as a standard for HPLC. Cys- 
GlcN-Ins in the thiol form was purified tmm 5 Uters (23 g wet weight) of late exponential 
phase mshDrJYnS cells essentially as described previously for MSH (Unson et al. 1998, 
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supra). However, DTT was added to the extract prior to thiol affinity chromatography and 
after the final preparative HPLC step in order to maintain the readily oxidizable Cys-GlcN- 
Ins in the reduced form. The final product (unretained on preparative HPLC and coeluted 
with Tris on HPLC) contained 1 yanaol of Cys-GlcN-Ins in 1 ml of 1 mM DTT, 1 M Tris- 
TFA, pH 7.0, corresponding to a 23% overall recovery. 

Assay of mycothiol synthase (MshD) in extracts 

[0231] ■ Exponential phase cultures (ODfioo-l) of M smegmatis mcHSS and mshD::Tn5 in 
7H9 Middlebrook media containing 1% glucose and 20 ^g kanamycin/ml (as necessary) 
were harvested by centrifiigation. The E. coli MshD expression strain pRS0819 and the 
empty pRSET C vector strain were cultured in LB medium containing 100 |Ag ampicillin/ml 
and 35 ng chloramphenicoVml to ODfioo- 0.6. The cells were induced with 0.75 mM 
isopropyl-P-D-thiogalactopyranoside for 16 h at 22»C. Cells were harvested and 
resuspended 1 :5 (w/v) in extraction buffer (50 mM HEPES pH 7.5, 50 mM NaCl, 1 mM 2- 
mercaptoethanol, and 35 jiM of the protease inhibitors, JV-a-p-tosyl-L- 
phenylalanylchloromethyl ketone and AT-a-p-tosyl-L-lysinechloromethyl ketone). The cells 
were disrupted by sonication on ice and the ceU debris was removed by centrifiigation at 
10,000g'(4°C) for 20 min. 

[0232] Mycothiol synthase assays were carried out in a 100 jil total volume containing 
10 ^1 M. smegmatis extract or 5 ^il of 100-fold diluted E. coli extract, 50 mM HEPES pH 
7.5, 2 mM DTT. 100 /M CoASAc, and 50 jjM Cys-GlcN-Ins. The reaction was sampled 
prior to addition of extract (zero time) and after 10, 20, and 30 min incubation at 37'>C for 
M smegmatis and after 3, 6, and 12 min for E. coli. Samples (20 nl) were quenched and 
derivatized by mixing with 20 ^l of 8 mM mBBr in acetonitrile at 60°C. The derivatized 
sample was incubated at 60"C for an additional 10 min. cooled on ice, acidified with 5M 
methanesulfonic acid to a final concentration of 10 mM, and clarified by centrifiigation. . 
The sample was diluted 4-fold in aqueous 10 mM methanesulfonic acid and analyzed by 
HPLC for the bimane derivatives of Cys-GlcN-Ins and MSH as described above. Controls 
included samples without extract and without CoASAc. Protein content of extracts was 
determined by the method of Bradford (Bradford 1976, siqira). 
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Results 

[0233] A M. smegmatis mutant, strain 49, is severely deficient in MSH and is resistant to 
100 ng isoniazid/ml, a concentration 100-fold above the minimal inhibitory concentration 
for the wild-type strain mcMSS (Newton et al 1999). Therefore tests were conducted to 
determine whether isoniazid could be used as a preliminary screen for isolation of MSH 
deficient mutants in a library generated fix>m M. smegmatis by the Tn5 transposition process 
which involves a cut-and paste mechanism (Goryshin et al 2000, supra). Transformants 
generated with the commercially available EZ::TN <KAN-2> transposon kit were initially 
screened on 10 fig kanamycin/ml and 30 ng isoniazid/ml. These transformants were then 
gridded and replated on 20 ^g kanamycin/ml, to select for the <KAN-2> transposon, and 
ICQ (ig isoniazid/ml, to select for mycothiol deficiency, generating a Ubrary of -2,000 
highly isoniazid-resistant mutants. These individual mutants were screened for thiols by 
HPLC and 3 mutants in the first 200 screened had a mycothiol content of <1% of the wild- 
type level. One of these mutants, designated mshDvJlnS, contained substantial amounts of 
Cys-GlcN-Ins as would be expected from a mycothiol synthase (MshD) mutant (Figure 1). 
The other two mutants were found to have levels of mycothiol precursors consistent with 
mutations in earlier steps of mycothiol biosynthesis. 

[02341 • The levels ofmycothiol and its precursors were analyzed and the MSH level in 
»w/i£)::Tn5 was found to be <5% of the mc^SS wild-type level as shown in Table 7 below: 
Table 7: CeUular levels of mycothiol precursors, mycothiol, and mycothiol synthase 
activity in M. smegmatis wild-type (mc'lSS) and mycothiol mutant (mshD:iTnS) 
StoinI GlcN-Ins Cys-GlcN-Ins MSH Synthase 

growthphase . , • .i 

(yxmoUg residual dry weight) [nmole mm 



(mg protein)''] 



mc^l55-exp 
mcnss-stat 



or 0008= 10=^ 5.8±0.8 

0.01" ND 10" ND 

7n5/iD::Tn5-exp 0.35 ±0.05 2.3 ±0.10 0.12 ±0.008 0±0.2 

mshD:.Tn5-s\zt 0.27 ±0.01 0.84 ± 0.025 0.065 ±0.010 ND 

Data from (Anderberg et al 1998). ND. not detenmned; exp, exponential growthphase ceUs; stat. stationary 
growth phase cells. 
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[0235] As shown in Table 7, whereas the level of Cys-GlcN-his in mc'lSS is very low 
and difficult to measure, Cys-GlcN-Ins proved to be a major thiol produced m mshD. iTnS, 
found at 290-fold higher level than in the wild-type strain during e:q)onential phase groAvth. 
The level of GlcN-Ins was increased 3- to 30-fold over that found in the wild-type strain. 
Thus, the two immediate precursors to mycothiol in the biosynthetic pathway accumulate in 
this mutant. An unidentified thiol was observed at 18 mm in the HPLC analysis and was 
estimated to occur at 1-5 pmol per g residual dry weight m diflEerent cultures. 

[0236] Assays of the mycothiol synthase activity in dialyzed cell extracts showed a 
complete loss of acetyl-CoA/Cys-GlcN-Ins acetyltransferase (MshD) activity in mutant 
mshD::Tu5 relative to the wild-type stain (Table 7), establishing that mshD::Tn5 is 
defective in MshD activity. 

[0237] The M. smegmatis genome has been completely sequenced but tiie annotation is 
not yet available, so the annotated M. tuberculosis H37Rv genome (Cole et al 1998, supra) 
on Tuberculist (witii url address: genoUst.pasteur.fr/TubercuList) was used to describe tiie 
transposon insertion and sequence analysis. A tblastx search of tiie initial portion of flie 
experimental sequence produced an optimal match witii ORF Rv0819 while tiie latter 
portion matched Rv081 8. This indicated tiiat tiie transposon had inserted into tiie M 
smegmatis gene corresponding to Rv0819, annotated as ahypotiietical protein of unknown 
function. A tblastn search of tiie M. smegmatis unfinished genome at tiie TIGR website 
using tiie M. tuberculosis protein sequence for Rv0819 produced a fit witii 60% identity for 
residues 1-253 in tiie +1 reading frame and a 50% identity for a fit of residues 245-309 in 
tiie +2 reading frame. Deleting cytosine-733 in tiie M smegmatis sequence produced a 
frame shift allowing an optimal alignment of tiie M tuberculosis and M. smegmatis 
sequences witii 62% overall identity as shown in Figure 8. Using tiie Sanger Centre 
databases matches were also found for tiie completed genome sequence of Af. l^rae (Cole 
et al 2001) witii 75% identity in a 3 15 residue overlap, for tiie finished genome sequence 
of Streptomyces coelicolor A3(2) witii 46% identity in a 256 residue overlap, and for tiie m- 
progress genome sequence Corynebacterium diphtheriae witii 34% identity in a 304 residue 
overlap. All of tiiese organisms belong to famiUes shown to produce MSH (Newton et al. 
1996). 
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[0238] To verify the assignment of Af. tuberculosis gene Rv0819 as an ortholog of 
MshD &om M smegmatis, it was cloned into pRSET C and expressed in E. coli. When the 
protein was expressed at 22° C the cloned MshD protran band was the dominant protein in 
the soluble crude cell extract with a subunit molecular mass of -38 kDa (results not shown). 
The MshD was active in the crude cell extract as the His6 tagged protein with a specific 
activity of 500 ± 50 nmole min ' (mg protein)"'. Parallel measurements on crude extracts of 
the E. coli empty vector control gave no measurable rate, 1 ± 1 nmole min'' (mg protein)'*, 
under identical conditions. Thus, the gene product of Rv0819 fiomM tuberculosis is 
confirmed as having MshD activity. 

EXAMPLE 11 
IDENTIFICATION OF MshA 

Strains and plasmids. 

[0239] Escherichia coli was grown in Luria-Bertani (LB) agar or broth at 37 "C. 
Ampicillin (Sigma; 100 ng/ml), kanamycin (Sigma; 50 |ig/ml) and hygroniycin 
(Calbiochem; 150 ^g/ml) were used when required. Af. smegmatis strains were grown at 
3TC in Middlebrook 7H9 broth (Difco) supplemented with 0.05% TWEEN-80® detergent. 
1% glucose and 10% OADC (BEL) or in 7H10 agar (Difco) supplemented with 0.5% 
glycerol, 1 % glucose and 10% OADC. For mycobacterial growth, hygromycin (75 ng/ml) 
and kanamycin (25 jig/ml) were used when required. 

Isolation of MshA::Tn5 

[0240] A Tn5 transposon horary was constructed using the BZ::TN <KAN-2> kit firom 
Epicentre and yielded three MSH-deficient mutants, which were strongly resistant to 
isoniazid. These were subcultured with shaking in 7H9 Middlebrook medium containing 
0.05% TWEEN-80® detergent, 1 % glucose and 20 ng/ml kanamycin at 37 »C, and samples 
were collected in late exponential or early stationary phase growth for determination of 
mycothiol components as described below. One mutant was found to have no measurable 
levels of MSH, GlcNAc-Ms, or GlcN-Ins and was designated as strain MshA::Tn5. 
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Genetic analysis ofMshA::Tn5 

[0241 ] A 250 ml culture of Afe/L4:;Tii5 was prepared for isolation of genomic DNA by 
standard methods (M.H. Larsen, "Some common mettiods in mycobacterial genetics. In 
G.F. Hatful and J. Jabobs, (Ed.), Molecular Genetics of Mycobacteria, ASM Press, 
Washington D.C., pp 313-320.8) but attenq)ts to sequence outward from the insert with this 
DNA gave poor sequence quality. The transposon with adjacent DNA (Figure 13) was 
therefore subcloned into pUC18 for sequencing. Briefly, genomic DNA was digested with 
either SaU or PstI (Invitrogen) and cloned into similarly digested pUCl 8 plasmid DNA. The 
resulting plasmids were used to transform competent E. coli DH5a (Invitrogen), and the 
kanamycin resistant transformants were analyzed for the presence of inserts larger than the 
EZ::TN plasmid DNA <KAN-2> transposon (1.2 kb), indicating the presence of M 
smegmatis genomic DNA. The plasmids containing transposons were isolated using a 
QIAprep® Spin Miniprep Kit (Qiagen) and was sequenced by the UCSD Cancer Center 
Sequencing Center using the EZ::TN <KAN-2> reverse primer RP-1 suppUed by Epicentre. 

Cloning of ORF Rv0486 &omM tuberculosis and complementation of mshA mutants. 

[0242] • M tuberculosis H37Rv genomic DNA was prepared as described previously. 
The open reading frame (ORF) Rv0486 was amplified by PCR using this DNA and primers 

486 PAL 3 ( 5' CATATGCACGGTCGGCAAGGAGG-3') (SEQ ID NO: 25) and 

486 PAL 5 ( 5' AGGATCCATGGCAGGTGTGCGGCAC 3') (SEQ ID NO: 26). 

These primers were designed to contain Ndel and BaniHl restriction sites, respectively. 
PCR was performed with Taq polymerase (Gibco BRL) using 1 mM MgCl2 and 5% 
dimethyl sulfoxide (DMSO). Thirty cycles of PCR included denaturation at 94 'C for 40 
sec, annealing at 60 'C for 1 min, and amplification at 72 "C. The PCR products were 
separated on a 0.8% Agarose gel. The appropriate PCR product was Ugated into the vector 
pCR2.1 of the TA cloning kit (Invitrogen), and transformed into competent E. coli cells by 
standard heat shock transformation procedure. Plasmid DNA was isolated and restriction 
digestion with Ndel and 5amHI(Fermentas) was used for verification and subcloning. A 
fragment corresponding in size to ORF Rv0486 was isolated from agarose gels, purified 
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using a QIAquick® Gel Extraction Kit (Qiagen), and ligated to pALACE to obtain the 
plasmid pAL0486. This pAL0486 plasmid was electroporated into the MSH deficient 
mutants Al and 49 as previously described and selection was performed on hygromycin 
plates (Newton et al. (1999), siq;)ra). 

Mapping the chemical mutation in mutant 49 

[0243] The M. smegmatis sequence obtained from The Institute for Genomic Research 
was used to design primers upstream of the start codon: 

(49seq 5' (5'GCAACGAGAAGGCCGTCGAACT 3') (SEQ ID NO: 27) 

and downstream of the 3' region 

(49seq 3' (5' GTCCTCGATGATCTTCCTGACA 3') (SEQ ID NO: 28) of theM 
smegmatis mshA gene. The primers were used to ampUfy M smegmatis mc^l 55 and DNA 
from two different colonies of mutant 49 and the ampUfied band was cloned into pCR2.1 
(Invitrogen). After ensuring by restriction digestion that the M. smegmatis mshA gene had 
been cloned, the DNA was sequenced using primers 49seq 5' and 49seq 3* as weU as the 
universal primers T7 and M13R. To sequence the internal region of tiie homolog and 
confirm the missense mutation, the same procedure was followed with primers 

49MED 5* (5' GCGTGGCGGTGTTGTCGGTA 3') (SEQ ID NO: 29) and 

49MED 3' (5' GACCAGTTGTTCGCGGCTCT 3') (SEQ ID NO: 30). 

[02441 Comparison of the sequences obtained in this manner revealed a single base pair 
change in the mutant converting a GGC codon to GAC. This results in a change in the 
amino acid at position 32 of tiie M smegmatis sequence firom glycine in mc^lSS to aspartic 
acid in mutant 49 (Figure 11). 

Determination of MSH, GlcNAc-Ins and GlcN-Ins. 

[0245] RepUcate cell pellets (-100 mg wet weight) were extracted and labeled witii 
mBBr for thiol analysis as described previously. Parallel samples were extracted at 60 in 
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50% acetonitrile containing 20 mM HEPES pH 8.0 for GlcNAc-Ins and GlcN-Ins analysis. 
The 50% acetonitrile extract was reduced under vacuum to remove acetonitrile from the cell 
free extracts. A zero time sample (20 pi) was removed, recombinant Rvl 170 (MshB; 100 
Hg/ml) added, and the samples incubated at 37 'C. Aliquots (20 \d) were removed after 30, 
60, and 90 min; all samples ware quenched by addition of an equal volume of acetonitrile 
containing 10 mM NEM and 2 mM 1,10-phenantbroline and heating at 60 "C for 10 min. 
The samples were derivatized with AccQ-Fluor® reagent (Waters) and analyzed by HPLC 
for the AccQ derivative of GIcN-Ins as previously described. The zero time samples were 
used to determine the level of GlcN-Ins in the cells. The addition of the MshB releases 

GlcN-Ihs from GlcNAc-Ins and provides a determination of GlcN-Ins plus GlcNAc-Lis. 

Since calculation of the value for GlcNAc-Ins requires subtraction of the GlcN-Ins level 

from this value, the GlcN-Ins level is only analyzed reUably when its value is a significant 

fraction ie.g. >10-20%) of the GlcN-Ins level. 

Antibiotic sensitivity testing 

[0246] Mutants and complements were grown m Middlebrook 7H9media supplemented 
with OADC and the appropriate antibiotics. The cultures were diluted to O.D.600mn=0.4 
and incubated at 37°C for 2-4 hours before being swabbed on Middlebrook solid media 
supplemented with OADC and the appropriate antibiotics. Tn the case of the conq)lements, 
mshA::Tn5 pAL0486 and 49::pAL0486, 1% acetamide was also added to the soUd media. 
After swabbing, the cultures were allowed to dry for 1 5 minutes and the Estrip ( AB 
Biodisk) was gently laid onto the plates. The plates were mcubated for 2-3 days before the 
MIC was determined. 

Sequence analysis. 

[0247] Preliminary sequence data for M. smegimtis was obtained from The Institute for 
Genomic Research website (at worldwide web address tigr.org). Sequences of MshA were 
obtained from GenBank with accession numbers as follows: M. tuberculosis Rv0486, NP- 
215000;.M leprae ML2143, NP-302584; Streptomyces coelicolor A3 2SCD46.18, 
NP_628379; Corynebacterium glutamicum NCgl0389, NP-599648. Sequence alignment 
using the Clustal W algorithm (J.D. Thompson et al. (1994) Nucleic Acids Res 22:4673-80) 
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was performed with Vector NTI Suite for Macintosh version 5.3.0. Results of the sequence 
alignment are shown in Figure 1 1 . 

RESULTS 

An M. smegmatis transposon mutant deficient in GlcNAc-Ins and MSH production 
iMshA::TnS). 

[0248] A Tn5 transposon library enriched for MSH-deficient mutants was produced 
using the EZ: :TN<KAN-2>) system from Epicentre and selecting for resistance to both 
kanamycin and isoniazid, the latter resistance havmg been established as a phenotype which 
is characteristic of MSH-deficient strains. Three mutants from the Tn5 transposon Ubrary 
were found deficient in MSH. One was tentatively designated mshA.-.TnS because it was 
found to produce no measurable amount of GlcNAc-Ins or GlcN-Ins (Table 8) which 
suggested that it was defective in MshA. 

Table 8: Levels of mycothiol and its precursors in mshA mutants and their 

complements 



M smegmatis 
strain 


Cellular level f umol per g residual drv weight) 


GlcNAc-Ins" 


GlcN-lhs" 


MSH" 


mc^l55 

mshA::Tn5 

/njM::Tn5pAL0486 

clone I*' 

clone 2*^ 
49 


<0.2 
<0.01 

0.10±0.02 
0.053±0.012 
<0.001 


l.OiO.2 
<0.01 

0.49±0.05 
0.64±0.03 
<0.001 


10±3 
<0.01 

9.4±0.4 
9.6±0.2 
<0.01 


49::pAL0486 
clone 1*^ 
clone 2" 


0.25±0.09 
0.15±0.04 


1.3±0.2 
0.84±0.08 


11.5±0.3 
11.1±0.3 



-VLean and range of duplicate samples of exponential cells cultured in Middlebrook 7H9 medium 
containing 1.0% glucose and 0.05% Tween 80 with 75 ng/ml hygromycin and 20 fig/ml kanamycin 

?>2OT^dby published methods for MSH (Koledin, et al., Arch. Microbiol. 178:331-337 
(2002)). GlcNAc-Ihs (Buchmeier, et al., Mol. Microbiol., 47:1723-1732 (2003).), and GlcN-lhs 

'Cells were cultured as above to ODeoo = 0.5, transferred to Middlebrook 7H9 medium without 
glucose, and induced for 20 h at 23 "C with 0.2% acetamide. 
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The mshA gene encodes a glycosyltransferase. 

[0249] In order to obtain high quality sequence for the genomic DNA of MshA::Tni at 
the site of insertion, it was necessary to subclone the transposon and adjacent genomic DNA 
into pUC18 using either Saa or Pstl (Figure 13). The SaK clone produced 205 bp of 
sequence and the Pstl clone produced 822 bp of sequence, both at the same insertion site, 
the Pstl clone corresponding to the complement of bases 507551-508372 of contig 3311 of 
the unfinished M.smegmatis genome at The Listitute for Genomic Research (on the world 
wide web at tigr.org). Since tiie Msmegmatis sequence has not yet been amotated, the 
experimental sequence was used to search the M. tuberculosis H37Rv genome database at 
GenBank using tblastx. This identified a 236 amino acid residue sequence ftom ORF 
Rv0486 having 84% identity with the translated experimental sequence. Having established 
the reading flame for the M smegmatis sequence, a downstream stop codon defined the 
temiination site for the M. smegmatis gene. The start position for the gene was taken to be 
defined by a GTG codon found upstream fipom the end of the region with high identity to the 
M tuberculosis sequence and downstream fix)m an in-fiame stop codon. This identified the 
sequence for MshA as shown in Figure 11. The M. smegmatis and M. tuberculosis 
MshA sequences are 75% identical in a 446 residue overlap. 

The Af. tuberculosis MshA gene (Rv0486) complements mshAiilnS and chemical 
mutant 49 

[02501 To verify that Rv0486 encodes the enzyme activity missing in the transposon and 
chemical mutants, the gene was cloned into the pALACE vector to produce pAL0486, 
which was used to transform mshAiiTnS and mutant 49. M. tuberculosis H37Rv genomic 
DNA was prepared as described previously. The open reading firame (ORF) Rv0486 was 
amplified by PGR using this DNA and primers 

486 PAL 3 (5'CATATGCACGGTCGGCAAGGAGG3') (SEQ ID NO: 25) and 

486 PAL 5 (5'AGGATCCATGGCAGGTGTGCGGCAC3') (SEQ ID NO: 26). 

These primers were designed to contain Ndel and BarntD. restriction sites, respectively. 
PGR was performed as described earUer and the appropriate PGR product was Ugated into 
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pALACE to obtain the plasmid pAL0486. Tbis pAL0486 plasmid was electroporated into 
the MSH deficient mutants as previously described and selection was perfonned on plates 
containing hygromycin for mutant 49 and hygromycm plus kanamycin for mshA::TTi5 
(Newton, etal, 1999, supra.) 

[0251] In each case, two clones were selected &om the hygromycin plates for growth in 
liquid culture to produce cells for analysis of MSH and its precursors. The results (Table 8) 
demonstrated that full restoration of MSH production to wild type levels occurs with 
acetamide induced cultures and levels of GlcNAc-Ins and GlcN-Ins increase to normal or 
near-normal levels in both mutants. This confirms that the loss of MSH biosynthesis 
capacity in mutant fitshA '. fTnS results solely fi»m inactivation of the mshA gene and 
demonstrates that mutant 49 is defective in the mshA gene, 

[02521 To ascertain fhs nature of die defect in the mshA gene of mutant 49, the mshA 
genes from the parent strain, M. smegmatis mchSS, and mutant 49 were cloned and 
sequenced. The M smegmatis sequence obtained fixsm The Institute for Genomic Research 
was used to design primers upstream of the start codon (49seq 5'; 
5'GCAACGAGAAGGCCGTCGAACT3') (SEQ ID NO: 27) and downstream of the 3' 
region (49seq 3'; 5'GTCCTCGATGATCTTCCTGACA3') (SEQ ID NO: 28) of theM 
smegmatis mshA gene. The primers were used to amplify the DNA 6om M. smegmatis 
mchSS and from two different colonies of mutant 49 and each amplified band was cloned 
into pCR2.1 (Invitrogen). After ensuring by restriction digestion that the M. smegmatis 
mshA had been cloned, the DNA was sequenced using primers 49seq 5' and 49seq 3' as 
well as the universal primers T7 and M13R. To sequence the internal region of the 
homolog and confirm the missense mutation, the same procedure was followed with primers 

49MED 5' (5'GCGTGGCGGTGTTGTCGGTA3') (SEQ ID NO: 29) and 

49MED 3' (5'GACCAGTTGTTCGCGGCTCT3') (SEQ ID NO: 30). 

Comparison of the sequences revealed a single base pair change in the mutant converting a 
GGC codon to GAC. This results in a change in the amino acid at position 32 of the M. 
smegmatis sequence from glycine in mc^l55 to aspartic acid in mutant 49 (Figure 11). 
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Orthologs of msItA are present in diverse MSH-producing actinomycetes. 

10253] Complete genomes are available for several mranbers of the actinomycetes 
belonging to genera previously shown to produce mycothiol (Newton, et aL, (1996) J. 
Bactenol 178:1990-1995.). These include M leprae, Streptomyces coelicolor A3i2), and 
Corynebacterium glutamicum, each contaming a gene that translates with a high degree of 
identity to the M tuberculosis and M smegmatis MshA proteins (Figure 1 1). The mshA 
mutants have altered antibiotic sensitivity compared to the parent strain. 

[0254] The ortholog from M leprae is 82% identical to the M. tuberculosis sequence 
whereas the evolutionarily more distant S. coelicolor and C. glutamicum sequences are 
respectively 49 and 47% identical. The sensitivity of the M smegmatis parent strain 
mc^lSS. transposon mutant mshAiiTnS and chemical mutant 49 to isoniazid was tested 
using Estrips (Oxoid) and assessing inhibition of growth after 2-3 days at 37°C. The MIC 
values determined were respectively 1, >250, and >250 ng/ml for isoniazid. The MIC 
values for mc^l55 and mutant 49 are similar to those found earUer for isoniazid, 2 and >50 
ixg/ml, by plating on antibiotic containing media (Newton, et al., 1999, supra.). When the 
isoniazid sensitivity of the complemented mshA::Tn5 and strain 49 mutants was tested on 
plates containing 1% acetamide but lacking glucose, the MIC values were 2.7 and 28 |ig/ml, 
respectively, demonstrating substantial reversion to the parental phenotype. The 
mecham'stic basis for the isoniazid resistance has not been estabUshed but there is evidence 
indicating that the isoniazid sensitivity is specifically linked to MSH rather than a generally 
high level of cellular thiol (Koledin, 2002, supra.). 

EXAMPLE 12 
CHARACTERIZATION OF M.tuberculosis MshC 

Reagents 

[0255] Middlebrook 7H9 was purchased from Difco Laboratories, and glucose and 
Tween 80 were from Fisher. MSH was isolated from Af. smegmatis as described (Unson, et 
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al, (1998) J. Immunol. Meth. 214, 29-39.) and the monobromobimane (mBBr, Molecular 
Probes) derivative (MSmB) was prepared and purified by the method of Newton, et al. 
(1995) Methods Enzymol. 251, 148-166. GlcN-Ihs was prepared by the quantitative 
hydrolysis of MSmB by purified M smegmatis mycothiol S-conjugate amidase as 
previously described (Newton, et al. (2000) Biochemistry 35, 10739-10746.). CySmB- 
GlcN-Ins was purified by preparative HPLC. after acid hydrolysis of MSmB, as described 
(Anderberg, et al. (1998) J. Biol. Chem. 273, 30391-30397.). 

Analysis of MSH and the precursors GlcN-Ins, GlcNAc-Ins 

[0256] Cells were extracted and derivatized with mBBr for thiol analysis as previously 
described (Koledin, et al. (2002) Arch. Microbiol. 178, 331-337.). The mycothiol 
precursors, GlcN-Ins and GlcNAc-Ins, were measured by the method of Buchmeier, et al. 
(Buchmeier, et al. (2003) Afo/. Microbiol. 47, 1723-1732.). 

MshC ligase assay 

[02571 The standard protocol for determination of MshC activity during enzyme 
purification was that described by Sareen, et al., (2002) Biochemistry 41 , 6885-6890. For 
kinetic studies with tiie natural substrates (ATP, L-Cys and GlcN-Ins) the concentiation of 
one substrate was varied, keeping the other two constant in the presence of -100 ng of 
purified M. tuberculosis MshC. Protease mhibitors were omitted in the kinetic stiidies. 
Alternative substrates to Cys were analyzed at 80, 200, 800 and 1600 /M with 50 /M 
GlcN-his and 1 mM each of ATP, MgCh and DTT in 50 mM HEPES pH 7.5 containing 
-100 ng of purified MshC. The reaction mixtares were incubated at 37 ''C and sampled at 4 
and 40 min. For the substates containing the thiol group, the assay mixtinre was derivatized 
with mBBr by the standard derivatization procedure and assayed for the corresponding thiol 
product (i.e. MSH from AcCys). For the non-thiol substrates (e.g. L-alanine) the MshC 
Ugase activity was determined by assay of AMP production as described below. 

Cys-tRNA synthetase assay 

[02581 The cys-tRNA syntiiase activity of MshC was examined using a modification of 
the methods of Schrier and Schimmel (Schreier. et al. (1972) Biochemistry 11, 1582-9.). 
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Purified E. coli cys-tKNA synthetase was used as a positive control for this reaction and 
was a generous gift fcom Kirk Beebe and Paul Schimmel of The Scripps Research Insitiute, 
La Jolla, California. Previous studies indicate that mycobacterial tRNA synthetases will 
charge E. coli tRNAs (Kim, et al. (1998) FEBSLett 427, 259-62.). Measurement of the 
formation of tRNA^ys was determined by the separation of fi«e l^c-cysteine &om l^C- 
tRNA^ys by the filtration of TCA precipitates (Schrier and Schimmel, 1972, si^ra). Acid 
precipitated counts are assumed to be l^C-tRNA^ys and control reactions without tElNAs 
were used to estimate background filter counts. 

[0259] Whatman GF/C 25 mm glass fiber filters were prewashed in 7% TCA and dried 
prior to use. E. coli cys-tRNA synthetase (15 |ig) or purified M tuberculosis MshC (34 jig) 
were assayed in 2 mM ATP, 4 mM MgCh, 20 mM DTT, 20 mM KCl, 0.1 mg/ml bovine 
serum albumin (Sigma), 10 mg/ml E. coli tRNAs (Boehringer Mannheim), 60 ^M l^C- 
cysteine (18 nCi/jmiole,Perkin Ehner), and 50 mM HEPES pH 7.5, final concentrations in 
an assay volume of 100 \il Reactions were mcubated at 23°C and 18 ^1 aUquots were 
removed j&om the reaction at 1, 3, 6, 9, and 15 min and mixed with 1 ml of aqueous 7% 
TCA These samples were incubated for 10 min at 23 *C and vacuum filtered. The sample 
filters were washed with 1 ml 7% aqueous TCA followed by 5 ml 95% ethanol and dried in 
a vacuum oven at 50 °C. Dried filters were counted in 9 ml Econo-Safe scintillation 
cocktail (Research Products Intemational) in a Beckman model LS1701 scmtillation 
counter. 

AMP assay 

(02601 The formation of AMP was assayed by HPLC with some changes in the method 
described by Beuerle, et al (2002). The Ugase reaction mixture (100 /£.) was terminated by 
the addition of NEM to 2 mM followed by 2 nL of 5 N methanesulphonic acid; it was 
immediately ftozen in dry ice. Analysis for AMP and ATP analysis conducted by HPLC on 
a Beckman Ultirasphere IP (250 x 4.6 mm) analytical column fitted with Brownley OD-GU 
5 nC-18 cartridge using a flow rate of 1 mL/min and the following Unear gradient: 0 min, 
0.1% B ( 77 mM KH2PO4, 2.2 mM tetrabutylammonium hydroxide, 38.5% methanol, pH 
5.5); 40 min, 100% B; 42 min, 100% A (10 mM KH2PO4 ,10 mM tetiabutylammonium 
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hydroxide, 0.25% methattol, pH 7.0); 50 min, 100% A (reinjection). The nucleotides were 
detected at 260 mn on a Waters 486 UV detector. 

Inhibition studies 

[02611 To test for inhibition of MshC Ugase activity, ~1 00-150 ng of the purified 
enzyme was incubated with different concentrations of inhibitor in 50 mM HEPES pH 7.5, 
for 30 min. at room temperature followed by the sequential addition of 1 mM each of DTT, 
ATP and MgCU, 70 jjM L-Cys, and 50 GlcN-Ins in a final volume of 30 /iL. Aliquots 
(12.5 nL) were withdrawn at 2 min. and 4 min for derivatization with mBBr and HPLC 
analysis as described earUer (Sareen, et aL, 2002, supra.). For MSH and Cys-GlcN-Ins 
inhibition analysis, the second substrate GlcN-Ins was used at a concentration near the Km 
value (300 fM) found in this study. The protocol was modified for the product (Cys-GlcN- 
his) inhibition studies to allow measurement of initial rate in the presence of ~5 /M added 
Cys-GlcN-Ins. The enzyme level was reduced 10-20-fold to produce a rate capable of 
producing a measurable increase in Cys-GlcN-Ins. 

Metal chelation by phenanthrolines 

[0262] Stock solutions of 1,10-phenanthroline (Kodak) and 1 ,7-phenanthroline (Aldrich) 
were prepared in dimethylsulphoxide. Purified amidase (90 ng) in 42 ;zL of assay buffer 
was incubated with 0.1-5 mM (n=4), of phenanthrolines at room temp, for 10 min. The 
Ugase reaction was initiated by the sequential addition of reaction components i.e. 1 mM 
each of ATP, MgCh and DTT with 1 mM L-Cys and 600 pM GlcN-Ins. After 10 min. of 
incubatibn at 37°C, the reaction was stopped and derivatized by the addition of 8 mM mBBr 
in acetonitrile, and acidified by 10 mM methanesulfonic acid followed by HPLC analysis as 
described earlier (W.). 

Zn supplementation 

[0263] Purified MshC (13 1 ng) was incubated with 2, 10, 50 or 100 /M zinc chloride in 
25 of the assay buffer at room temp or at 37 •C. The hgase reaction was initiated by the 
sequential addition of 1 mM each of ATP, MgCU and DTT, 100 fM L-Cys and 50 fM 
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GlcN-Ins. At 2 and 4 min 12.5 //L aUquots were withdrawn and analyzed for thiol content 
as described above. 

Cloning of ntshC icysS2) in pACE. 

[0264] mshC/R.v2l3Qc was earlier cloned in pRSETA into BamHl I HindSS. sites under 
T7 promoter in E. coli BL21 DE3 pLysS (Sareen et al., 2002, supra). While there was 
activity in the soluble fraction, the bulk of the protein was found to accvunulate in the form 
of insoluble inclusion bodies. Attempts to solubiUze and reactivate the protein proved 
unsuccessful. Cloning of the native MshC protein and expression in M. smegmatis as a 
more suitable host was therefore examined. 

[0265] The expression was then tried in M. smegmatis under the acetamidase promoter 
of pACE (DeSmet et al., (1999) Microbiol. 145, 3177-3184) and pALACE (Koledin et al., 
2002, supra) vectors, at the restriction sites, BamHl and Clal. The mshC gene in pRSETA 
was restriction digested with BamH I and Hind m, subcloned in the vector pSODrr-2 at the 
two respective sites. The gene was then digested from this vector with BamHl and Clal and 
was further subcloned at the respective sites in pACE. The M tuberculosis mshC 
(Rv2130c) was cloned into pACE, a shuttle plasmid for E. coli and mycobacteria, having a 
cloning site downstream of an inducible M. smegmatis acetamidase promoter ODe Smet, 
1999, supra.) to produce pACE::wsAC (Figure 14). The pACE::MshC was used to 
electrotransform the M. smegmatis 164 mutant, which is deficient in MshC activity (Rawat, 
et al. (2002) Antimicrob. Agents Chemother. 46, 3348-3355.). M smegmatis mc2l55 
contains native MshC protein, which is translated in more than one form and would 
contaminate the recombinant M tuberculosis MshC protein. Thus, a mycothiol mutant, 164, 
deficient in MshC was used as a host for expression of M tuberculosis mshC. Strain 164 is 
deficient in MshC ligase activity due to a Leu205Pro amino acid substitution resulting from 
a single point chemical mutation (Rawat, et al., 2002, supra) and produces much reduced 
levels of mycothiol (Table 9). 

[0266] The electrocompetent cells of mutant 164 were made by repeated washing of the 
cells cultured to exponential phase (A<ioo = 0.5), with sterile 10% glycerol. After 
electroporation, the cells were supplemented with one mL of 7H9 + 1% Glucose and 
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shaken at 3TC for 4 hours before plating onto 7H9+1% glucose plates, supplemented with 
75p.g/mL hygromycin. 

Growth of recombinant MshC culture and pACaEMshC expression 

[02671 M smegmatis MshC mutant 164 complemented with M tuberculosis mshC 
(Rv2130c), hereafter denoted I64::pACE/7is:/iC, was grown in 7H9 medium supplemented 
with 0.05% Tween 80, 10 % OADC (BBL) and hygromycin (75 jag/ml) at 37°C and 250 
ipm. The culture was propagated on a large scale in the same media but with 1% glucose 
instead of 10% OADC. The induction was initiated at A«)o = 0.3 with acetamide by 
centrifuging the cells at 8000 g for 15 min and resuspending them in a new media without 
glucose and instead, 0.4 % acetamide as the carbon source, while maintaining the antibiotic 
selection pressure. After 28 h of cultivation at 37»C and 250 rpm. the bacterial cells were 
collected by centrifugation at 8000 g for 15 min. The ceU pellets, about 2.3 g^ter. were 
stored at -70°C until fijrther use. 

[0268] Overexpression of the MshC Ugase in pACEmAC transformed M. smegmatis 
strain 164 with acetamide induction yielded protein in the soluble fraction of the cell-free 
extract Following induction of pACEmAC by 0.4% acetamide in the mutant strain 164, the 
MSH content was measured and found to be complemented to a level 150% that of the wild 
type strain mcHsS (Table 9). The MSH biosynthesis intennediates, GlcN-Ins and GlcNAc- 
Ins, were found to accumulate in mutant 164 to a level ~20-fold higher than that of the wild- 
type strain. Upon MshC induction in the complemented 164 strain the levels of both MSH 
precursors dropped to values typical of the wild-type strain and the MSH content increased 
150-fold to a value above the wild-type level (Table 9). 

Table 9: Mycothiol and precursor levels in M. smegmatis parent and mutant 

strains 

M smegmatis strain cellular content (|xmol/(g re sidual dry weight)) 

GlcNAc-Ins GlcN-Ins MSH 

mc^l55 (parent) 

mutantI64 4±2 19±1 0.1 
mutant I64::pACEwjAC ^:05 QA±0.l 11±1 
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Purification of recombinant ligase 

[0269] All operations were carried out at 4°C in the presence of 3 mM 2- 
mercaptoethanol and 5 mM MgCh unless stated otherwise. Twenty grams of 
I64::pACEwj/iC cells (wet wt.) from 9 liter broth was suspended in 80 ml of 50 mM 
HEPES buffer (25% ceU suspension), pH 7.5 in the presence of 35 jiM of the protease 
inhibitors TPCK and TLCK. The cells were disrupted by ultrasonication (Branson Sonifier 
200) in an ice batiL The cell dd>ris was removed by cratrifugation at 100,000 g for 1 h at 
4»C. The supernatant obtained was used as the source of the enzyme. The cell free extract 
thus obtained was subjected to 20% ammonium sulfate precipitation in ice, for 2 h followed 
by centrifugation at 28,000 g for 30 min. The supernatant was further subjected to 20%- 
45% aminonivun sulfate precipitation in ice for an overnight and the precipitated proteins 
were pelleted by centrifugation at 28,000 g for 30 min. The protein pellet (4.8 g) was 
resusupended in 48 ml of the 50 mM HEPES, pH 7.5 containing 35 pM of TPCK and 
TLCK and was desalted by passing it throu^ Sephadex G-25 column. 

[02701 The 150 ml material obtained from G-25 column was applied on DEAE 650-M 
column (5.2 x 10.6, 225 ml) preequiUbrated with 50 mM HEPES, pH 7.5. The enzyme was 
eluted at 0.2 M NaCl by running a linear gradient of 0-0.5 M NaCl in 15 colunm volumes of 
the buffer at 300 ml/h (Figure 1 5 A). The fractions containing the enzyme activity were 
combined (235 ml) and were diluted to twice the volume (480 ml) with MilU-Q water to 
lower the salt concentration. 

[0271] The diluted solution was appUed to a Bio-gel HTP column (2.6 x 1 1 .3 , 60ml) at 
120 ml/h, which was pre-equilibrated with 10 mM potassium phosphate buffer, pH 6.8 and 
100 mM NaCl. The bound proteins were eluted with a linear gradient of 10 mM to 100 mM 
phosphate concenfration and 100 to 0 mM NaCl concentration in 15 colunm volumes. The 
active fractions were collected (76 ml) and fractions 29, 31, 33, 34, 35, 36, 37 were 
analyzed for purity on 12.5% SDS-PAGE. There were few impurities left (Figure 15B), so 
fractions 29-37 were pooled, precipitated with 80% ammonium sulfate, and taken up in 50 
mM HEPES buffer pH 7.5 for gel filtration chromatography on Sephacryl-200 column (247 
ml) at 10 ml/h in 50 mM HEPES. pH 7.5 and 150 mM NaQ. (Table 10) Fractions 49-54 
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were analyzed on SDS-PAGE before pooling. The pooled enzyme was concentrated in 10 
kD membrane filters (Sigma) and stored in 50% glycerol at -70«»C in 30 aUquots for the 
detailed characterization studies. 

[0272] Amidase activity was found to be soluble in the cytoplasmic fraction on 
sonication of the cells and the level of activity was found to be ~400-fold greater than in M. 
smegmatis mc^l55 (Sareen, et aL, 2002, supra.) the parent strain of mutant 164. The 
recombinant protein eluted as a single peak with parent Mr = 34 kD on the S-200 column 
(actual Mr 45,591). Thus, the M. tuberculosis MshC protein exists as a monomer in its 
native form. This contrasts with its ortholog fix)m M. smegmatis, which forms dimers and 
tetramers in the native state (Sareen, et al., 2002, supra.). 

Table 10: Purification of M. tuberculosis Cys:GlcN-Ins ligase (MshC) 



Step 



protein" 
(mg) 


total activity 
(imiol min-l) 


specific activity 
(nmol min-^ mg-1) 


yield 
(%) 


Purif. 
fector 


1670 


14600 


8.7 


(100) 


(1) 


540 


14000 


26.0 


96.0 


3.0 


94 


2590 


27.5 


17.7 


3.7 


10 


1076 


108 


7.4 


12.4 


0.6 


93 


155 


0.64 


17.8 



Crude extract 
20-45% SAS 
DEAE ion exchange 
Hydroxyl apatite 
S-200 gel filtration 

"Protein concentration based upon A280 value, where 1 AU = 0.58 mg/ml for purified MshC. 
Metal analysis 

[02731 The purified enzyme was analyzed by Inductively coupled plasma-atomic 
emission spectroscopy (ICP) at the San Diego Gas & Electric Environmental Analysis 
laboratory for 26 metal ions; Al, Sb, As, Ba, Be. B, Cd, Ca, Cr, Co, Cu, Fe, Pb. Mg, Mn, 
Mo, Ni, K, Se, Si, Na, Sr,Th,Ti,V and Zn. The enzyme sample was diluted to 1 mg/ml and 
buffer was also submitted for background metal analysis. 

[02741 The ICP showed that there is 0.7 mol of Zinc ion/mol of enzyme. Attempts to 
inactivate the enzyme by chelating the metal present with 1,10-phenanthroline produced no 
significant loss of activity (Figure 16) and supplementation of the enzyme with Zn^* led to a 
loss in activity (Figure 16). Thus, if Zn^"" is required for activity, it must be quite tigihtly 
bound and not easily removed or complexed by 1,10-phenanthroline. In addition there must 
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be residues capable of binding Zx?* in a feshion which distorts or blocks the active site in a 
fashion which interferes with enzyme activity. 

Testing of MshC for Cys-tRNA Ugase activily 

[0275] The initial reaction of cysteinyl-tRNA synthetase and MshC is the formation of 
enzyme bound AMP-cys. Since E. coli cys-tEtNA synthetase and M. tuberculosis MshC 
(cysS2) are homologs, they have been used to focus on the second reaction, the tRNA 
charging reaction of cys-tRNA synthetase and MshC. It was found that the reactions were 
complete in 1 min at room temperature and gave stable filter counts after that time. The 
rates for the formation of tRNAcys were 1 1 ± 2.5 (n=5) and 0 ± 0.2 (n=4) nmole/min/mg 
protein for E. coli cys-tRNA synthetase and Af. tuberculosis MshC, respectively. Thus M 
tuberculosis MshC, annotated as cysS2 or cystdnyl-tRNA synthetase 2, will not charge E. 
coli tRNAs in this assay. The foregoing shows that the activity of MshC is the ATP 
dependent formation of Cys-GlcN-Ins, an intermediate in the mycothiol biosynthesis 
pathway, and not the synthesis of tRNA^ys. 

Stoichiometry of the reaction catalyzed by MshC 

[0276] Purified MshC was utiUzed to establish the stoichiometry of tiie reaction 
catalyzed. It had been initially assumed that the ATP-dependent production of Cys-GlcN- 
Ihs yielded ADP as a coproduct by analogy with the enzymology of y-glutamylcysteine 
biosynthesis, the mtermediate precursor of glutathione (Newton, et al. (2000) J. Bacteriol. 
182, 6958-6963; Bomemann, et al. (1997) Biochem. J. 325, 623-9.). However, when MshC 
was identified as a homolog of cysteinyl-tRNA synthetase (CysS), where the overall 
reaction produces AMP plus pyrophosphate, it appeared likely that the product of the MshC 
catalyzed reaction was also AMP. To verify this the levels of Cys, ATP, AMP and Cys- 
GlcN-Ins were monitored over 60 min in a reaction initiated with 100 /M Cys, 100 /M 
GlcN-Ins, and 200 /M ATP. The results (Figure 15) show that AMP is indeed the product 
derived from ATP. After 60 min the reaction was 30% complete and for each equivalent of 
Cys utilized 0.82 equivalents of Cys-GlcN-Ins was produced, accompanied by the 
utilization of 1.19 equivalents of ATP and the production of 0.95 equivalents of AMP. This 
estabUshes the reaction stoichiometry as indicated in Figure 1. 
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Enzyme kinetics and substrate specificity 

[0277] The factors influencing the enzymatic assay of MshC activity at 37 C were 
explored in 50 mM HEPES buffer, pH 7.5 containing 1 mM DTT. With 100 /M GlcN-his, 
0.5 mM Cys, and 0.5 mM ATP as substrates, the reaction rate increased sharply with Mg^* 
concentration up to 100 jlM, then leveled and became constant fiom 1-5 mM Mg^*. Under 
the same conditions but with 5 mM Mg^* and varying the ATP concentration, the rate 
increased up to 100 pM ATP, remained constant to 1 mM ATP, and then declined 2.2-fold 
at 5 mM ATP. Based upon these results 1 mM each of Mg^* and ATP were selected as 
standard assay concentrations. The apparent Km values were estimated from Eadie-Hofetee 
plots, which were linear over the indicated range of concentration. With 1 mM each of 
ATP and GlcN-Ins, the apparent for L-Cys (5 /M to 1.6 mM, n = 10) was determined to 
be 85 ± 20 /M and the apparent K™« was 1450 ± 200 nmol min"' mg For 1 mM ATP and 
0.5 mM Cys, the apparent for GlcN-Ins (10 /Al to 3 mM, n = 6) was determined to be 
280 ± 43 and the apparent was 1160 ± 120 nmol min'^ mg *. The apparent for 
GlcN-Ins of 280 ± 43 /M is about 4-fold higher than the value reported for the enzyme 
purified from M. smegmatis. 

[0278] Several thiols related to cysteine were tested as alternative substrates to Cys, each 
examined at concentrations ranging from 80 pM to 1.6 mM. The results obtained at the 
highest concentration are given in Table 11. The enantiomer, D-Cys, was a poor substrate. 
Neither p-mercaptopropionic acid nor cysteamme, derived fiom Cys by removal of the 
amino and carboxyl groups, respectively, produced evidence of reaction. Nor was 
significant activity detected with AcCys. L-homocysteine, L-serine. or L-alanine. Thus, the 
enzyme is highly specific for Cys. The specificity is less stringent for GlcN-Ms, with GlcN 
having -1% the activity of GlcN-Ins increasing linearly over the range of concentration 
studied. 

Inhibition of MshC 

[0279] Various compounds were tested over the concentration range 80 - 1600 pM as 
inhibitors of the ligase activity measured with 70 \M L-Cys and 50 nM GlcN-Ins. Results 
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for the highest level tested are sho\m in Table 1 1 . Only very minor inhibition was produced 
by the amino acids D-Cys, L-homocysteine, L-a-aminobutyiic acid, L-serine, and L- 
alanine. p-mercaptopropionic acid (deaminated Cys) and AT-acetylcysteine also produced 
minor inhibition. Of all compounds tested (he best inhibitor was cysteamine 
(decarboxylated Cys) which produced a 3-fold reduction in rate at 1.6 mM. Mycothiol also 
produced minimal inhibition at concentrations of 1 and 5 mM, representing physiologic 
levels. 

Table 11: Substrate specificity and inhibition of MshC 

compound relative spec, act." inhibition 

[inhibitor] (aM) % iphibition 



Cys (70 /M) (100)*^ 
D-Cys 

P-mercaptopropionate 



<0.7 1600 15 ± 8 

<0.004 1600 -20 ±20 

cysteamine ^-16 1600 65 ±10 

AcCys • <0.24 1600 20 ±6 

<0.2 1600 0±5 

<0.008 5000 15 ± 8 

<0.016 5000 4±8 



1000 4 ± 3 

5000 7±5 



L-homocysteme 
L-serine 
L-alanine 

D-GlcN (1.6 TcMf 1.2 
MSH 

MSH . I . 

"Ligase activity determined at 37-C with 50 pM GlcN-Ins and 1.6 mM test substrate, unless 
otherwise noted. Relative specific activity calculated firom maximal peak mtensity at 
highest test concentration and assuming linear dependence upon concentration. Inhibition 
determined with 50 fiM GlcN-Ins and 70 yiM Cys under standard assay conditions. 
^Specific activity 108 nmol min'^ mg\ ''in place of GlcN-Ins with 100 pM Cys. 

[0280] The MshC Ugase was also tested for feedback inhibition by Cys-GlcN-Ins. The 
intracelhilar level of Cys-GlcN-Ins was found to be 5-10 fM mM. tuberculosis, when 
analyzed at different growth time points (Buchmeier, Newton, Koledin and Fahey, 
unpubUshed). So, it was logical to analyze Cys-GlcN-lns in the concentration range of 1-10 
/M with levels of the Cys and GlcN-Ins substrates near their values, 70 and 300 /M, 
respectively. The apparent Ku, value for Cys of 70 ± 15 found here is nearly double tiie 
value found earUer for the M. smegmatis enzyme. Cys-GlcN-Ins produced <1.0 and 3.6% 
inhibition when tested at 0.7 fM and 6.6 fM, respectively, which shows tiiat there is no 
significant feedback inhibition at physiological levels of the reaction product. 
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EXAMPLE 13 

ESSENTIALITY OF MYCOTfflOL IN M. tuberculosis 

[0281] The use of conditional null mutants to establish essentiality in M. tuberculosis has 
not yet been accompUshed so the present example employed the general approach used by 
Parish and Stoker (Parish, et al. (2000), J. Bacteriol. 182:5715-20) to test the essentiaUty of 
the glnE. A second copy of the mshC gene was introduced into Avild type M. tuberculosis 
using an integrative vector pCV125 (kindly provided by Medhnmune) which was modified 
to contain the spectinomycin/streptomycin (Sp/Sm) cassette from pKRPlS. This vector 
containing the mshC gene has been constructed and tested on M. smegmatis strain 164, a 
chemical mutant defective in mshC and MSH production (Rawat, 2002, supra.). It was 
shown to be effective in restoring MSH production in M. smegmatis 164. pCV125 
mtegrates into the att site in the M tuberculosis chromosome and will stably introduce a 
second copy of the mshC gene into a second location of the chromosome. The mshC ORF 
plus its nbosomal binding site (71 bp upstream of the ATG start codon) was ampUfied by 
PGR using genomic M. tuberculosis (Erdman) DNA- The forward primer 
5'-TCCCCCGGGACGCGTGGCGCTGAT-3' (SEQ ID NO: 45) , contains a Smal 
restriction site, and the reverse primer 

5'-GGACTAGTCTACAGGTCCACCCCGAGCAG-3' (SEQ ID NO: 46), contains a Spe\ 
restriction site which was used for directional cloning. The PGR fragment was ligated with 
pCR 2.1(Invitrogen) using T4 DNA Ugase and used to transform TOP lOP (Invitrogen) E. 
coli. After selection on agar plates (LB, ampiciUin 100 ng/ml) and growth in broth, plasmid 
DNA was analyzed by restriction analysis and sequencing. The SmaVSpel fragment 
containing the mshC gene from this plasmid was cloned between the Smal and Spel sites 
within the aph gene in pCV125. This resulted in a vector containing a copy of the mshC 
gene that is transcribed from the aph promoter. Vector DNA was introduced mto wUd type 
M tuberculosis by electroporation with selection on 7H11 plates containing streptomycin. 
As a control, pCV125 with no extra DNA was introduced into other aliquots of M. 
tuberculosis. Streptomycin resistant colonies were grown up, chromosomal DNA was 
extracted, and the presence of 2 copies of the mshC gene were confirmed by Southern 
hybridization. Ncol digests were initially used because this enzyme cuts outside of flie 
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mshC gene and will allow for easy identification of differences in flanking sequences 
between the native copy of mshC and the introduced copy. Sad digests were also used to 
analyze the original and the introduced copies of mshC within the genomic DNA of 
transformants. 

[0282] After the presence of the second copy of the mshC gene was confirmed using 
Southern hybridization, 2X.-mshc was infected with the specialized transducing phage 
containing the mshC knockout construct. The specialized transducing phage is a variant of 
a mycobacteriophage described by Bardarov et al. (Bardarov, et al. (2002), supra; Bardarov, 
et al. (1997) Proc. Natl. Acad. Sci. 94:10961-6.) and was a gift ftom J. Cox. The 
specialized transducing phage containing the mshC knockout DNA was constnicted by 
amplifying -500 bp of the upstream fragment (protein N-teraiinal region of mshC) 
comprising 102 bp of the mshC gene and 370 bp of downstream sequence, and -500 bp of a 
middle fragment (residues 195-708) of the mshC gene. This results in a deletion of the 
residues encoding the active site "HGLH" (SEQ ID NO: 47) region of the protein. Each 
primer set incorporated suitable endonuclease sites to allow subcloning of the PGR product 
into plasmid pJSC284 such that a hygromycin resistance cassette was inserted within the 
mshC ORF. After verifying the fragment incorporation, the plasmid was digested with 
Pad, treated with alkaUne phosphatase to prevent plasmid rejoining during subsequent 
Ugation and Ugated into the Pad site of the specialized transducing phage phAE87. DNA 
was packaged into A, phage using Gigapack m Gold packaging extract (Stratagene) and this 
was used to infect HBlOl E. coli grown on maltose to promote phage uptake. Colonies 
were selected on hygromycin. Cosmid DNA was extracted from the hyg' colonies and was 
used to transform M smegmatis mc^SS. The transformation plates were incubated at 30°C 
until plaques appeared (2-3 days). Plaques were picked and a high titer phage stock was 
prepared fiomM smegmatis mc 155. 

[02831 For infection of M tuberculosis, 10 ml of bacteria was washed with MP buffer 
and resuspended in 1 ml MP buffer at 39°C (Buchmeier et al. (2000) Molec. Microbiol. 
35:1375-1382.). Phage was added at a multiplicity of infection of 10 and the mixture was 
incubated at 39°C for 4 hr to allow for phage infection. The bacteria were spun down, 
resuspended in 500 ml MP buffer and plated on 7H1 1 plates containing hygromycin (50 
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mg/ml). Hygromycin resistant colonies appeared in 3 to 5 weeks. Individual colonies were 
grown up for analysis by Southan hybridization (to verify the presence of the mshC 
knockout) and for measurement of MSH content The alleUc exchange substrate should 
recombine preferentially with the native copy of the mshC gene and not with the introduced 
copy since tiie flanking sequences of tiie introduced mshC gene differ from the flanking 
sequences oimshC'm the phage mutagenesis construct. The corA knockout phage constract 
was included as a positive control for the transduction procedures. 

[0284] In initial experiments with M. tuberculosis with only the original copy of mshC, 
the phAE^/jC phasmid produced hygromydn resistant transformants that contained 
parental mycothiol levels. This experiment was repeated several times with a total of 67 
hygromycin clones analyzed by Southern blot. All clones were found to have the original 
copy of mshC intact. Representative clones such as clone 49 (Table 12) were examined for 
mycothiol content and found to have the parental levels of mycothiol. Thus, no authentic 
nishC mutants were detected in the 67 clones examined from transformation of the parental 
strain of M tuberculosis Erdman. 

[0285] Initially the integrative plasmid deUvering the second copy of mshC was tested 
on a chemical mutant of mshC in M smegmatis (strain 164) which produces about 1% of 
the parental level of mycothiol (Rawat. et al., 2002, supra.). Incorporation of the M 
tuberculosis mshC gene resulted in the production of 150% of the parental level of 
mycothiol in this mycothiol mutant (results not shown). This same plasmid was used to 
transform M. tuberculosis Erdman to make the merodiploid host 2x mshc identified as strain 
1682 in a Southern blot. There is a single Sad site in mshC which generates 2 bands in the 
original copy observed in the M. tuberculosis Erdman (Erd) strain as well as in the second 
copy as observed in 1682, the 2x mshC host strain in another Southern blot. The mycothiol 
content 2x mshC strain is nearly twice the parental level (Table 12). One of the substrates 
of MshC, GlcN-Ins, is reduced by 64% by the presence of the second copy of mshC in the 
2x mshC strain 1682 (Table 12). This is consistent with Ugase substrate depletion due to a 
higher level of cellular MshC activity. The Southern blots taken together with the 
mycothiol analyses verify the presence of a fimctional second copy of mshC in the 2x mshC 
host strain. 
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Table 12: Mycothiol and precursor levels in M tuberculosis Erdman mshC knockout 
strains^. 





OD600 


GlcNAc-Ins 


GIcN-Ins 




Host and Control strains 




nmoles/lO" Cells 




Af. tuberculosis Jardman ypareny 


0.50 


1.7 ± 0.7 


8.9 ± 0.2 


13.7 ± 0.2 


Strain 3 


0.41 


^0.7 


7.3 ± 0.1 


12.1 ± 1.3 


1682 Bximan (+ 2nd copy MshQ 


0.85 


0.9 ±0.1 


3.2 ±0.1 


26.2 ±1.2 


Strain 49 


0.47 


1.7 ±0 


9.2 ± 0.1 


21.5 ±0.5 


mshC knockout strains in (2x mshC 


■> M tuberculosis Erdman 1682 




Strain 14 


0.36 


^1.4 


14.1 ± 0.2 


12 ± 0.3 


Strain 16 


0.37 


^0.6 


5.7 ± 0.3 


14.6 ±0.1 


Strain 18 


0.43 


^1.2 


12 ±1 


10.2 ±0.1 


Strain 157 


0.58 


1.5 ± 0.7 


12 ±1.4 


10.2 ±0.1 


Strain 158 


0.52 


1.7 ±0.4 


15 ±1 


11.8 ±0.1 


Strain 172 


0.49 


1.0 ±0.5 


8.5 ± 0.2 


11.3 ±0.2 



"Results expressed as mean and range of duplicate samples. 



[0286] This organism served as a primary host for the attempted mshC knockout 
experiments. Using this host 6 of 12 or 50% of the hygromycin resistant transformants 
were found to be authentic knockouts in the original copy otmshC (Table 13). Strains 
identified as 14, 16, 18, 157, 158, and 172 all lack the original copy of mshC, as was seen in 
a Southern blot, but do contain the second copy of the gene. These strains also make 
parental levels of mycothiol and its precursor GlcN-Ms (Table 12), demonstrating the 
second mshC gene if fully functional. In one case, strain 3, it appears that the second copy 
and not the origmal copy of mshC was interrupted by the knockout phasmid. As a positive 
control for the transformation efficiency, a transformation of M tuberculosis Erdman 
(without a second corA copy) with a phasmid carrying a knockout construct for the non- 
essential magnesium transporter corA was successful in knocking out corA in 6 of 18 or 
33% of the transformants (results not shown). An empty plasmid pCV125 control strain of 
M tuberculosis host Erdman was transformed with the mshC knockout phasmid 
(phAEAmshQ and 0 out of 7 of the transformants were authentic mshC knockouts by 
Southern blot (results not shown). Thus, no transformants can be found without mycothiol 
in any of the above experiments. Additionally no authentic mshC knockout strains can be 
found by Soutiiem analysis, exc^t when a second copy of mshC is supplied. These results 
clearly support that mshC specifically, and mycothiol generally, are essential for the 
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siirvival of M tuberculosis in laboratory culture. These experiments demonstrate the 
essentiality of mycothiol and validate the mycothiol biosynthesis genes as possible drug 
targets. . 

Table 13: M tuberculosis Erdman directed knockouts in mycothiol biosynthesis gene mshC (Rv 
2130c). 

— " ' Knockouts by Transfonnants Knockout EflBciency 

Southern blot ^camined^^ 



MAuberculosis Erdman + 
empty vector (for 2x mshC) 
(negative control) 

M.tuberculosis Erdman + 
phasmid phAEAcor4 
(positive control) 

MAuberculosis Erdman + 
phasmid phAEAwwAC 



MMberculosisBcAxaaii IxmshC 
(strain 1682)+ phasmid phAEA/w^AC 



7 0% 

18 33% 

67 0% 

12 50% 



axransformants are hygromycin resistant colonies appearing within 5 weeks. 

[0287] Although the invention has been described with reference to the above examples, 
it will be understood that modifications and variations are encompassed within the spirit and 
scope of the invention. Accordingly, the invention is limited only by the following claims. 



